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We investigate the relationship between finite terms in A| e trec, the lambda calculus with 
letrec, and the infinite lambda terms they express. As there are easy examples of infinite 
A-terms that, intuitively, are not unfoldings of terms in A| etrec , we consider the question: 
How can those infinite lambda terms be characterised that are A| etre c-expressible in the 
sense that they can be obtained as infinite unfoldings of terms in A| etr ec? 
For 'observing' infinite A-terms through repeated 'experiments' carried out at the head of 
the term we introduce two rewrite systems (with rewrite relations) -> r cg and -* r cg+ that 
decompose the term structure, and produce 'generated subterms' in two notions. Thereby 
the sort of the step can be observed as well as its target, a generated subterm. In both 
systems there are four sorts of decomposition steps: -^A-steps (decomposing a 
A-abstraction), ->-® - and -^©j-steps (decomposing an application into its function and 
argument), and respectively, -^ dc i-stcps (delimiting the scope of an abstraction, for ->reg), 
and ->s (delimiting of scopes, for ^ rog + ) . These steps take place on infinite A-terms 
furnished with a leading prefix of abstractions for gathering previously encountered 
A-abstractions and keeping the generated subterms closed. We call an infinite A-term 
'regular'/'strongly regular' if its set of ^ r0 g-reachable/^ reg +-reachable generated subterms 
is finite. Furthermore, we analyse the binding structure of infinite A-terms with the 
concept of 'binding-capturing chain'. 

Utilizing these concepts, we answer the question above by providing two characterizations 
of A| etrec -expressibility. For all infinite A-terms M, the following statements are equivalent: 
(i): M is Aietrec-expressible; (ii): M is strongly regular; (iii): M only has finite 
binding-capturing chains. 

1. Introduction 

A prevalent enrichment of the A-calculus is the extension by a fi or letrec construction, the 
latter being a generalisation of the first. Such constructions allow for a finite representation 
of infinite A-terms, which for typed A-calculi (and thus for most functional programming 

t This work was supported by NWO (Nederlandse Organisatie voor Wetenschappelijk Onderzoek) in the 
framework of the project Realising Optimal Sharing (ROS) under the direction of Doaitse Swierstra 
and Vincent von Oostrom. 
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languages) is necessary to facilitate recursion and ensure Turing completeness. A term 
in the A-calculus with letrec (which we denote by A| etr ec) is usually understood as a 
representative for the A-term obtained by infinite letrec-unfolding. 

But it turns out that not every infinite A-term can be expressed finitely in A| etre c 
(for an example with an informal explanation, see Example |l.f | below) . On the other 
hand, A| e trec-expressible infinite A-terms have infinitely many A| e trec-representations (cf. 



Example 1.2 1. We say that a A-term is 'A| e trec-expressible' if it has a representation as 



finite term in A| etrec . 

Example 1.1 (not A| e trec-expressible). Consider the infinite A-term of the form M = 
\a.\b. (Ac. (Xd. . . . c)b) a with term tree as shown in Figure[l]on the right. While this term 
tree has a regular structure, the scopes of the abstractions in it are infinitely entangled: 
the scope of Xa reaches into the scope of A6, the scope of Xb into the one of Ac, and so 
on. This feature of M can suggest the idea that it is impossible that M is the result 
of stepwisely 'unrolling' a A| e trec-term in a manner that respects scopes. Such a process 
would namely 'tile' its result in a regular manner with finite term-context tiles having a 
bounded scope-nesting depth such that furthermore the scopes of abstractions contained 
in different context tiles do not overlap. This excludes, intuitively, the formation of the 
infinite entanglement of successively overlapping scopes that can be observed in M. - The 
term M will indeed be recognised as not A| e t re c-expressible. 
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Fig. 1. Termgraphs for Example 1 1 . 1 1 and Example 1 1.2 



Example 1.2 (A| e trec-expressible). The infinite A-term N = Xx.x-x- ... can be ex- 
pressed by Ax. letrec r = x : r in r as well as by Ax. letrec r = x : (x : r) in r (for the term 
graphs of these A| etrec -terms as well as the term tree of N of their infinite unfolding, see 
Figure[l]on the right). The colon (:) used here is inspired from the functional programming 
language Haskell. It is a binary infix operator constructing a list from its arguments by 
putting the first argument (here: x) in front of its second argument. This example is an 
implementation of what is known as the repeat function. The A-calculus we deal with 
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here does not explicitly include operators. Therefore in the context of this work the colon 
can be viewed as a free variable or a closed A-term. 

Note that, if colon is viewed as a closed A-term that contains A- abstractions, then, 
contrary to the term M from Example abstraction scopes in the term tree of N are 
organised in blocks of of bounded scope-nesting depth (for example each occurrence of 
colon then is such a block), and no infinite entanglement of scopes takes place. 

Overview of the paper. Section [2] is concerned with terminology and notation used 
for known formalisms. In Section [3] a rewrite system for unfolding terms in the A-calculus 
with letrec is formulated. In Section [4] we introduce rewriting systems for decomposing 
infinite A-terms into 'generated subterms', and we show some properties of these systems 
in connection to so-called scope-delimiting strategies. Also in this section, we define 
regularity and strong regularity for infinite A-terms employing the concepts os generated 
subterms and scope-delimiting strategies. In Section [5] we adapt the rewrite systems 
for decomposing infinite A-terms and the notions of scope-delimiting strategies to the 
A-calculus with letrec. In Section [6j we develop proof systems that are sound and complete 
for the notions of regularity and strong regularity, for equality of strongly regular infinite 
A-terms, and for the property of a A| e trec-term to unfold to an infinite A-term. In Section [7] 
we examine the binding structure of infinite A-terms (binding-capturing chains) and 
connect to the concepts introduced so far. In Section [8] we establish the correspondence 
between strong regularity and A| etrec -expressibility for infinite A-terms. In Section [9] we 
introduce 'A-transition graphs' of infinite A-terms and of A| etrec -terms as labelled transition 
graphs in which the edges carry one of the four different labels @o, @i, A, and S. In 
Section [TO] we summarize and provide an outlook on possible results that are related, and 
on potential applications of the concepts we introduce. 



2. Preliminaries 

In this section we gather most of the basic known concepts that play a vital part in 
the rest of our paper. Some central notions concerning rewriting are recapitulated from 



(Terese, 2003), while for others we provide references. Some variations of known concepts 
that are tailormade for our purposes are formulated in definition environments. 

We let N = {0, 1,2,.. .} and N + = N \ 0. For a partial function f ;A^ B, and a e A we 
denote by /(a)l the fact that / is defined for a, and by /(a)t that / is not defined for 
a. The domain of / is the set dom(f) := {a e A | f(a)\.} £ A, and the range of / the set 
ran(f) := {f(a) \ a e A, f(a)l} E B. 

For relations R £ A x B and S £ B x C we denote by R • S the composition of R 
with S defined by R ■ S := {(x, z) \ (3y e B)(x, y) e R A (y, z) £ S} E Ax C, and by R* the 
reflexive and transitive closure of R under composition, which is defined by R* •■= UieN R l 
where R° := id A ■= {(x, x) \ x e A} and, for all i e N, R t+1 := R ■ R\ 

Abstract Rewriting Systems. An abstract rewriting system (ARS) is a quadruple 
(A, 3>, src, tgt) consisting of a set A of objects, a set $ of steps, and src,tgt : <& -> A, the 
source and target functions. We will always assume that A n $ = 0. For objects a e A wc 
denote by ^outC ) and by $i n (a) the set of steps in $ that depart (are outgoing steps) 
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from a, and that arrive (are incoming steps) at a, respectively. We say that an ARS is 
finite if its set of steps is finite. For ARSs Ai = {Ai, <&j,srCj,tgtj) for i e {1, 2} we say that 
Ai is a sub- ARS of A2 if A\ £ A 2 , $1 £ $2, and srci, tgt : are the restrictions of src2 and 
tgt 2 , respectively, to $1, which are required to be total functions (this implies that, for 
all 0e$x, it holds that src 2 (0) = srci(0) e A\, and tgt 2 (0) = tgt 1 (0) e Ai. 

Induced sub- ARS. For an object a e A of an ARS .4 = (A,&, src, tgt) we denote by 
(a-») := (A',<I>',src',tgt'} the sub- ARS of A induced by a, where A' comprises only the 
objects from A that are reachable from a by an arbitrary number of steps (or no steps) 
and with $', src', tgt' being the restrictions of $, src, tgt to the objects in A' and to steps 
between objects in A'. 

Bisimulations between ARSs. Let Ai = {Ai, $ i; srcj, tgtj) for i e {1, 2} be ARSs. A 
relation B £ (Ai x A 2 ) u ($1 x $ 2 ), which relates objects with objects and steps with steps, 
is called an ARS-bisimulation if: 

— if a\ B a 2 , then B relates each step from a\ to some step from a 2 {forth condition), 
and each step from a 2 to some step from a\ [back condition); 

— if 0i B 4>2 with 0i : ai -»■ a'i and </> 2 : a 2 -»■ a 2 , then ai B a 2 and a\ B a 2 . 

Labellings of ARSs. Let A = {A, $, src, tgt) and A' = (A',$',src',tgt') be ARSs. 

(i) An ARS-bisimulation L between A and A' is called a labelling of A to A', and 4' 
i/ie h-labelled version of A, if the converse L" of L is a function L" : A' u $' -»■ Au $, 
and if additionally, for all a' e A' and aei with a L a', the restriction L~ |$< t (a') : 
^out( a ') ^out(o) of L" to the steps departing from a' is bijective. 

(ii) A rewrite labelling £ of A to .4' is a pair (L, [} consisting of a labelling L of A to 
A' together with an initial labelling function [ mapping objects of A to bisimilar 
objects of A' . 

Strategies. A history-free strategy for an abstract rewriting system A is a sub- ARS of 
A that has the same objects, and the same normal forms as A. A history-aware strategy 
for an abstract rewriting system A is a history-free strategy for the L-labelled version of 
A with respect to, and together with, a rewrite labelling (L, [} of A. By a strategy for A 
we will mean a history-free strategy or a history-aware strategy for A. 

Remark. Let S be a history-aware strategy of A, and let A' be that L-labelled version 
of A which S is a history-free strategy of. Then § projects to a history-free strategy § 
of A. The projection is defined by L, which induces a local bijective correspondence of 
outgoing steps of related sources of A and A'. Mind that for deterministic S, § may 
become non-deterministic. Furthermore, every rewrite sequence according to § in A' 
projects to a unique rewrite sequence in A (which is a rewrite sequence according to §). 

The last mentioned fact makes it possible to speak, for a given rewrite labelling, of 
rewrite sequences of a history-aware strategy on the objects of the original ARS. Let S 
be a history-aware strategy for an ARS A, and a an object of A. Suppose that S is a 
sub- ARS of the L-labelled version A' of A for some rewrite labelling (L, I) of A. Then 
by a rewrite sequence of § on a (in A) we will mean the projection to A of a rewrite 
sequence of S (in A') on the result [(a) of the initial labelling applied to a. 
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Rewrite relations: notation and properties. For the single-step rewrite relation 
induced by an ARS we use the arrow symbol -»■ possibly subscripted with appropriate 
names. Let -> £ A x A be a rewrite relation. We denote by -» the many-step rewrite 
relation induced by by which we mean the reflexive and transitive closure of By -> + 
we denote the one- or-more- step rewrite relation of the transitive closure of By ->- = 
we mean the zero- or- one- step rewrite relation of the reflexive closure of By a normal 
form of -»■ we mean an a e A such that there is no a' e A with a -*■ a'. By -> ! we mean the 
reduction to normal form rewrite relation induced by the restriction of -»■ to a relation 
with the normal forms of -> as codomain: -> ! = {(a, a') \ a -» a', a' is normal form of ->•}. 

The rewriting properties below are reformulations, and some are slight variations, of 
known properties of rewrite relations. 

Definition 2.1. Let -+2, -*-3 be rewrite relations. The rewrite relation ->i is called 
co final for ->2 if ^2 £ • *-2- We say that -»i is co final for -+% with trailing -^3-steps if 
-»2 £ • * - 3- Furthermore we say that ->-i factors into -*■% and -+3 if -»i £ -> 2 • -> 3 . 



We will several times use the following specific version of Konig's Lemma. 

Konig's Lemma. Let G = (V, E) be an undirected graph with set V of vertices and 
set E of edges. Suppose that G has infinitely many vertices (V is infinite), that it is 
connected (for all vertices v,w eV there exists a path in G from v to w) and that every 
vertex has finite degree (it is adjacent to only finitely many other vertices in G). Then 
for every vertex v e V, G contains an infinitely long simple path from v, that is, a path 
starting at v without repetition of vertices^] 

Combinatory Reduction Systems. Many of the formalisations we introduce are 



based on the framework of Combinatory Reduction Systems (CRSs) (Klop, 1980), (Klop 
et al., 1993) (Terese, 2003, Sec. 11.3), and, in particular, on infinitary Combinatory 



Reduction Systems (iCRSs) (Ketema and Simonsen, 2011). 

Infinite CRSs. When speaking of 'infinite terms' for CRSs over some signature we 
draw on (Terese, 2003 12.4) and (Ketema and Simonsen, 2011) where meta-terms of 
iCRSs are defined by means of metric completion. The metric is defined on a-equivalence 
classes of finite preterms dependent on the minimal depth at which two finite preterms 
belonging to the equivalence classes have a 'conflict'. The objects formed by the metric 
completion process can be represented as equivalence classes of infinite preterms, we call 
them iCRS-preterms, with respect to a notion of a-equivalence that again is based on 
the notion of 'conflict' (see also (Terese, 2003 Def. 12.4.1)). Hereby iCRS-preterms are 
infinite ordered dyadic trees in which each node is either labelled by a variable name, 
and then the node does not have a successor, or by named abstractions Aa; (with some 



This formulation corresponds to the following original formulation by Denes Konig on page 80 in 
(Konig, 2001): "Satz 3: Jeder unendliche zusammenhangende Graph G endlichen Grades besitzt einen 
einseitig unendlichen Weg, wobei der Anfangspunkt Po dieses Weges beliebig vorgeschrieben werden 
kann." This in connection with the definition on page 10 in fiKonig, 200l|: "Eine unendliche Menge 
von Kanten PiPi + ± (£=0,1,2,... in inf.), bzw. der durch sie gebildete Graph, heifit ein einseitig 
unendlicher Weg, falls fur i + j stets Pi # Pj ist." 
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Fig. 2. Schroer-style proof system v4°°(E) for a-equivalence of iCRS-preterms over 
signature E: for every / e E with arity n, ^4°°(E) contains a rule /. In instances of 
the rule [] , the constant c is chosen fresh for s and t. Substitution which occurs in 
the assumption of [ ] denotes substitution by variable replacement on 
iCRS-preterms. It needs not to be capture-avoiding because of the freshness of the 
substituant. 



variable name x), and then the node has a single successor node, or by an abstraction 
symbol, and then the node has a right and a left successor node. 

For denoting infinite preterms (and later terms) we use, as much as possible, usual 
notation for dealing with finite terms. A slight exception is our use, in certain situations, 
of a finite-CRS-based notation for infinite A-terms that are not A| e trec-expressible (e.g. see 



Example 4.25 ) 



By iCRS-terms we will mean a-equivalence classes of iCRS-preterms. The notion of 
a-equivalence on iCRS-preterms based on the absence of conflicts can be described by 
provability in the pr oof system A°°("E) in Figure [2] wh ich is a variant of a proof system 



due to Schroer (see (Hendriks and van Oostrom, 20031) 



Definition 2.2 (a-equivalence for iCRS preterms, Schroer-style proof system). 

The proof system j4°°(E) for a-equivalence on iCRS-preterms over signature E consists of 
the axioms and rules displayed in Figure [2j each of which contains a rule / for every / e E. 
Provability in yl°°(E) of an equation between preterms is defined as the existence of a 
possibly infinite, completed derivation: for example, by s = t we mean the existence 

of a possibly infinite prooftree T>°° with conclusion s = t such that maximal threads from 
the conclusion upwards either have length u>, or have finite length and end at a leaf that 
carries an axiom 0. (We will generally use the decorated turnstyle symbol to indicate 
provability by completed, possibly infinite derivations.) 

However, closer to coinductive proof systems for infinite A-terms that we develop 
is the following different, but equivalent characterisation of a-equivalence for infinite 
iCRS-preterms, a variant for iCRS-terms of a proof system for a-equivalence between 



finite A-terms due to Kahrs (see (Hendriks and van Oostrom, 2003)). 



Definition 2.3 (a-equivalence for iCRS preterms, Kahrs-style proof system). 

The proof system A°°(E) for a-equivalence on iCRS-preterms over signature E consists 
of the axioms and the rules in Figure [3] with, for every / e E, a rule /. Provability of an 
equation between preterms in A°°(E) is defined, analogously as in[2j as the existence of a 
possibly infinite, completed derivation. 

This formulation of a-equivalence for infinite A-terms will be the key to our formulation 
of a 'coinduction principle' for infinite A-terms in Theorem |9.7| 

Infinite rewrite relation. For an iCRS with rewrite relation we denote by ->» 
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— {%} s - {z}t (if y does not occur in s, 

{xy}y={zu}u {xy}s = {zw}t an d w does not occur in t) 

{xy}s = {zu}t {x} si = {y}ti {x} s n = {y} t„ ^ 

{x}[y]s = {z}[u]t {x}f(s 1 ,...,s„) = {y}f(ti,...,t„) 

Fig. 3. Kahrs-style proof system A°°(S) for a-equivalence on iCRS-preterms over 
signature E: for every / sS with arity n, A°°(E) contains a rule /. 



the infinitary rewrite relation induced by strongly convergent and continuous rewrite 
sequences of arbitrary (countable) ordinal length. Hereby strong convergence means that, 
at every limit ordinal, the rewrite activity in the terms of the rewrite sequence tends to 
infinity. Continuity means that the terms of the rewrite sequence converge, in the metric 
space of infinite terms, at every limit ordinal. By -* u we will denote the rewrite relation 
induced by strongly continuous -^-rewrite sequences of length uj. 

Labelled transition systems, labelled transition graphs. A labelled transition 
system (LTS) is a triple C = (S, A, «•) consisting of a set S of states, a set A of labels, and 
a set »£5xixSof A-labelled transitions. Labelled transitions (s\, a, s 2 ) e S x A x S 
will be indicated as s 1 -5- s 2 - 

A labelled transition graph (LTG) G is a pointed LTS, that is, G = (S, A, x, >-*■) where 
(S, A, >*) is an LTS, and x € S, which is called the initial state. 

Bisimulation between LTSs, LTGs. Let C\ = (Si, A, -^i) and £ 2 = (S2, A, -+2) be 
a LTSs over a common set of labels. A bisimulation on £ is a binary relation R £ Si x S 2 
that satisfies, for all s e Si and t e S 2 : 

(i) if s R t and s -S-i s', then there exists t' e S 2 such that t -5- 2 t' and s' R t'; 

(ii) if s R t and t -i 2 t' , then there exists s' e Si such that s s' and s' Rt' . 

Two states s e Si and t e S 2 are bisimilar, denoted by s ±± t, if there exists a bisimulation 
i? such that s Rt. 

Two LTGs Gi = (S\,A,x\,-*\) and Gi = (S 2 , A, x 2 , -*- 2 ) are bisimilar if there is a 
bisimulation on the underlying LTSs that relates the initial state x\ of Gi with the initial 
state x 2 of G 2 . 



3. The A-calculus and the A| etrec -calculus 

This section provides the definitions for terms in the A-calculus and the A| etr ec-calculus 
that we will be using througout this work. We define CRS signatures and a rewriting 
system R v for unfolding A| etrec -tcrms to obtain infinite A-terms. 

Definition 3.1 (first-order representation of A and A| etrec ). Let X = {x ,xi,x 2 , . . .} 
be a set of variable names for A-abstractions, and 1Z be a set of names for recursion 
variables. We will use x, y, z as syntactical variables for variable names bound by A- 
abstraction, and /, g, h for recursion variable names; and similarly, we will use L, P, Q 
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for terms. The set of A| e trec-terms is inductively defined by the following grammar: 



(term) L ::= Xx.L (abstraction) 

LL ( application) 

x ( variable) 

letrec _B in L (letrec) 

( binding group) B ::= fi= L ... f„ = L ( equations) 



(fi, . . . , f n e TZ all distinct) 

The set of A-terms is defined by a reduced form of the grammar with the letrec alternative 
and the binding group rule left out. 

On this grammar the following rules describe unfolding of A| etre c-terms in an informal 



manner: 






(£) 


letrec B in L Lj 


-> (letrec B in L ) (letrec B in Z^) 


(f A v) 


letrec B in Xx.Lq 


-> Ax. letrec B in Lo 


^letrec) 


letrec Bq in letrec £?i in L 


-> letrec Bq, B\ in L 


(^v CC ) 


letrec -B in fi 


-* letrec Bin Li (if S is /i = Li . . . /„ = L„ 


(f r v Cd ) 


letrec /i = Li .../„ = L„ in L 

(if 


-> letrec = Ljj . . . /j , = Lj- , in L 

are the recursion variables reachable from L) 




letrec in L 


L 



The names of the first four rules are chosen to reflect the kind of term that resides 
inside of the in-part of the letrec-term, which helps to see that the rules are complete in 
the sense that every term of the form letrec B in L is a redex. 

We will use higher-order notation and rules to reason about the A-calculus and the 
A-calculus with letrec, which immediately validates our results for a-equivalence classes 
instead of just preterms. 



Remark 3.2 (infinitary rewriting). We use CRSs as a rewriting framework since 



until now infinitary rewriting theory has only been developed for CRSs yet 


Ketema and 


Simonsen, 2009 


Ketema and Simonsen, 2010 


Ketema and Simonsen, 2011 


• 



For formulating the above rules as a CRS, we provide CRS-signatures for A and Ai e ti 



Definition 3.3 (CRS signatures for A and A| e trec)- The CRS-signature for A con- 
sists of the set T,\ = {app, abs} where app is a unary and abs a binary function sym- 
bol. The CRS-signature SA tetrec consists of the countably infinite set SA tetrec = u 
{let„, rec-in„ | n e N} of function symbols, where, for n € N, the symbols let„ and rec-in„ 
have arity n. 

By Ter(A) and Ter(A| etrec ) we denote the set of closed CRS-terms over and £,\| etr< , c 
respectively, with the restriction that 

— the symbols let„ and rec-in„ only occur as patterns of the form 

let„([/i . . . f n ] rec-in„(Mi, . . . , M n ,M)) for some terms Mi, . . . , M n , M e Ter(X) 

— and that otherwise a CRS abstraction can only occur directly beneath an abs-symbol. 
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Ter(A) and Ter(A| etrec ) will later be specified more formally in Definition 6.f and Defini- 
tion E3 

We will use M, N, O used as syntactical variables for terms in Ter(X). And by 
Ter(A| etre c) we denote the set of CRS-terms over Ea, for which we will use the symbols 
L, P, Q as syntactical variables. 



Example 3.4. The term in Example |1.1| in CRS notation: 
abs([a] abs([6] app(abs([c] app(abs([d] app(. . . , c)), &)), a))) 

Example 3.5. The terms in Example 1 1. 2 1 in CRS notation: 
abs([x]leti([r] rec-ini(app(app(:, x), r))),r) 
abs([a;]leti([r] rec-ini(app(app(:, x), app(app(:, x), r)),r))) 



Definition 3.6 (terms in A°°). We denote by Ter((A°°)) the set of finite and infinite 
CRS-terms over the signature Note that the set of infinite A-terms subsume finite 
A-terms, thus whenever we speak of an infinite A-term in fact we refer to a potentially 
infinite A-term. 



Definition 3.7 (CRS for unfolding in A| e trec)- The CRS R v for unfolding \i etrec -terms 
is the CRS for terms over the signature EA letrec (see Definition 3.3 1 with the following rule 
schemes in which n varies among numbers in N + : 

(q®): let„([/]rec-in„(X 1 (/),...,A:„(/) ! app(Zo(/),Z 1 (/)))) 

^app((let n ([/]rec-in„(...,X n (/),Z (/)))), (let n ([/]rec-in n (...,X n (/),^ 1 (/))))) 

(4) : let„([/] rec-in n (X 1 (/) ) . . . ,X n (f),abs([x]Z(f,x)))) 

-+ abs([.x] let„([/] rec-m^X^f), . . .,X n (f),Z(f, x)))) 

(g l ; tTec ) : let n ([/]rec-in n (X 1 (/) ) . . . ,*„(/), let m ([p] rec-\n m (Yi(f,g), ■ ■ ■ ,Y m (f,g)Z(f,g))))) 
- let n+m ([/5] rec-in n+m (X 1 (/), . . . , X„(/), Y 1 (/,g), . . . ,Y m (f,g),Z(f,g))) 

(QT) : let n ([/]rec-in n (X 1 (/),...,X„(/),/ i )) - let n ([/]rec-in„(X 1 (/),...,A'„(/),X i (/))) 
: let (rec-in (Z)) - Z 

: let„([/ x .../„] rec-in„(X 1 (/ ilil , ..../,. ) X n (fn, n fi mn ,J,) 

^(/il,n+l > • • • ) fim n+1 ,n + l ) ) 

-* \et n >{[f h . . . f Jn ,]rec-m n \X J1 {L l ^ 1 ,. . .,L mjidl ), . . .) 

The formulation here of the rule scheme p™ d for reducing the binding group of a 
letrec-binding by removing unreachable equations needs additional explanation. On the 
right-hand side, the terms denoted by the symbol L with different indices are defined as 
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letrec / = letrec g = f g in g in / 

►v-rec letrec / = letrec g = f g in g in letrec g = f g in g' 

f = letrec q = f q in g . , 
>v.ictrcc letrec J , y y in g' 

=fg 

/ = letrec g = /g in p . , 

► v . rec letrec , , in/ 3 

ff =fg 

/ = letrec g = fg in . ., „ „. „. , 

► v . rec letrec J , y y y in (letrec g" = /g" in 3") g' 

9 =fg 

/, / = letrec g = /g in g „ „ . „ Wl / = letrec g = / g in g . ,-. 

► v .e (letrec y J y y in letrec g" = /g" in g") (letrec J , y y y in g') 

g = fg g = fg 

f = letrec g = f g in g 
/1 ? r r ■ n\ 1, f = letrec g = f g in g . A 

^v.ictrcc (letrec g' = /g' in g") (letrec , y Jy in g'j 

f n g = fg 

g =fg 



Fig. 4. Unbounded growth of the binding group in a -> v -rewrite sequence on a 
Aietrec-term that does not contain -> v . rc d-stcps. 



follows: 

T I fiy.x ^ 6 Vlj ■ ■ ■ i3n'} , r „ , . , i j 1 ^ ^ \ 

Lyj^. := -! (for all 1 < a; < n and 1 < y < rrij x ), 

I Xw.w else 

relative to the indices ji, . . . , j n > with 1 < ii < . . . < j„' ^ n that are precisely the indices 
that are reachable from the indices in {ii n+ i, . . . ,i mn+1 .n+i} (the indices I of /; that are 
applied to the variable X) via the binary reachability relation «■ on {l,...,n} that is 
defined, for all x,y e {1, . . . , n}, by: x y if and only if there exist a z e {1, ... , m^} with 
«z,:z = 2/, formally that is: {j u . . . , j„,} = {l\i»* I for some i e {ii >n+ i, . . . , i m „ +1 , n +i}}- 

The ARS induced by the CRS -R v will be denoted by R v . And by ^- v .<a , ->- v .a ; ~^v-nii > 
"^v-xec ) ^v ictroc j and ^-y.rcd we denote the rewrite relations of both R v and R v that are 
induced by the rules g® , g^, gY; , g™ c , g l ° trcc , and p™ d , respectively. 

Remark 3.8 (motivation of the rules g™ d and g™ 11 in .R v ). The purpose of taking up 
the rule p™ d , together with the rule g™ 1 into the CRS R v consists in preventing unbounded 
growth of binding groups during unfolding. Consider for instance the outermost rewrite 
sequence on the term letrec / = letrec g = f g in g in / shown in Figure [4] Applications of 
the rule p™ d are able to remove unreachable equations in binding groups. 

While restricting the size of binding groups during unfolding is a sensible constraint on 
the unfolding process, it is not strictly necessary to define the unfolding of a A| e trec-term. 
We could also use a rule g^ ee in place of g™ d and g 1 ™ 1 where g ™ e is (informally) defined 
as: 

(g% cc ) ■■ letrec /i = Zi . . . f n = L n in L ->■ L (if fx, . . . ,/„ do not occur in L) 

which allows steps like Ax. letrec B in x -»■ Ax. a;, and thus allows to move bound variables 
out of the in-part of letrec-expressions. (Note that in i? v such a step can be simulated by 
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-►v-red-step followed by a -^ v . n ii-step.) We will, however, embed the unfolding rules into 
other rewriting systems of which we wish to perform unfolding in a lazy way such that 
the number of derivable terms is bounded, the approach with the p^ co -rule runs counter 
to that idea, as it easily leads to an unbounded growth of binding groups. 

Remark 3.9 (shape of the rule g™ d in i? v ). The rewrite rules g™ d of R v are not 

'fully extended' as the metavariables Xi occuring in the left-hand side of the rule do not 
have to be instantiated with all recursion variables /i , . . . , /„ bound in the abstraction 
prefix. This is due to the design of this rule scheme in which reachability of a recursion 
variable /, from the in-part of the formalised letrec-term is defined by extracting from the 
format of the specific instance which recursion variables f±, . . . , /„ occur in which of the 
metavariables X\, . . . , X n . 

In applications of a rule f?y d , unreachable recursion variables are removed from the 
abstraction prefix on the right hand side. Since the format of CRSs requires that corre- 
sponding metavariables on the left- and on the right-hand side of a rule must have the 
same arity, occurrences of unreachable recursion variables as arguments for metavariables 
describing the binding group of a reachable recursion variable cannot simply disappear 
on the right-hand sind. In the definition above, a specific, but arbitrarily chosen A-term, 
namely Xw.w, is substituted for such occurrences of unreachable recursion variables. This 
does not interfere with the unfolding operation defined later. 

Furthermore we profit from the property of normal forms w.r.t. g r ° d that the set of free 
variables of the A| e trec-term corresponds to the set of free variables of its unfolding, which 
we will utilise in the mention rewriting systems later on. 

Definition 3.10 (reduced A| e t r ec-terms). A A| e trec-tcrm L is called reduced if it is a 
normal form with respect to -> v . n ii and -^ v . ro d- If a A| e t re c-terms P reduces to a reduced 
term L by exclusively -» v . n ii- and -^ v . ro d-steps, then P is called a reduced form of L. 

Proposition 3.11 (confluence of letrec-unfolding). R v is confluent. 



Proof. We give a proof based on decreasing diagrams (Terese, 2003 Sec. 14.2) by 
showing that parallel steps are confluent. The proof involves a comprehensive critical-pair 
analysis. It can be found in Appendix [A] on page[69| □ 

Example 3.12. R v when applied to letrec / = Xxy.fyx in / admits the following rewrite 
sequence: 

letrec / = Xxy.fyx in / 
-►v.rec letrec / = Xxy.fyxm Xxy.fyx 
Ax. letrec / = Xxy.fyx in Xy.fyx 
^v.a Xxy. \etrec f = Xxy.fyxm fyx 

-* v .<a Xxy. (letrec / = Xxy.fyx in f y) (letrec / = Xxy.fyx in x) 
-►v-red Xxy. (letrec / = Xxy. f y x in / y) (letrec in a) 
-►vnii Axy. (letrec / = Xxy. fyx\nfy)x 
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-> v .@ Axy. (letrec / = Xxy.fyx in /) (letrec / = Xxy.fyx in y) x 
^v.rcd Axy. (letrec/ = Xxy.fyx in /) (letrec iny)a; 
->■ v.nii Arry. (letrec / = Azy. fyx\nf)yx 
-►v-xec Axy. ... I/I 

However, not every A| etrec -terms unfolds to an infinite A-term in the sense that it has 
an infinite A-term as its infinite -» v -normal form. For example, the A| e trec-term L = 
letrec / = / in / admits only rewrite sequences of the form L -^ V . ICC L -*- v . rec ■ • ■, and hence 
does not unfold to an infinite A-term. Terms like this are unproductive in the sense that 
during all outermost-fair rewrite sequences the production of an infinite A-term stagnates 
due to an unproductive cycle. 

Definition 3.13 (i£ v -productivity). Let L be a A] etr ec-term L. We say that L is 
Reproductive if the following statement holds: 

— L does not have a -» v -reduct that is the source of an infinite -> v -rewrite sequence 
consisting exclusively of outermost steps with respect to -^ v . rC c, ~^v-nib ^v-ictrec, or 

~~ *■ V-red • 

Lemma 3.14. Let L be a A| e trec-term of the form letrec B in P. Then exactly one of the 
following statements hold: 

— All maximal outermost-fair _R v -rewriting sequences on L solely contain terms of the 
form letrec C in Q. 

— All maximal outermost-fair i? v -rewriting sequences on L only have finitely many 
terms of the form letrec C in Q. 

Alternatively: 

Then the following statements holds either for all maximal outermost-fair iJy-rewriting 
sequences or for none: the sequence contains only terms of the form letrec C in Q. 

Lemma 3.15. For all A| e trec-terms L the following statements are equivalent: 

(i) L M for some infinite A-term M. 

(ii) L is .Reproductive. 

(iii) Every maximal outermost-fair -» v -rewrite sequence on L is strongly convergent. 

Proof, (ii) => (iii), because if L is Reproductive then every outermost occurrence 
of a letrec in every -> v -reduct will be eventually pushed down to a higher position by 
either a -» v .a- or a -> v @-step of any maximal outermost-fair -> v -sequence. Since only 
letrec-terms are redexes in i? v any maximal outermost-fair rewrite sequence starting from 
L converges to an infinite normal form, (i) follows directly from (iii). (i) =>■ (ii) follows 



from Lemma 3.14 by contradiction. If L is not _R v -P roc hi c tive then it has by definition a 



-reduct with at least one occurrence of a letrecwhich cannot be pushed further down 



by any outermost application of any i? v - rme - By Lemma 3.14 the same holds for every 
other maximal outermost-fair rewrite sequence. Therefore L cannot unfold to an infinite 
A-term M because M may not contain any letrecs. □ 
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Lemma 3.16 (uniqueness of unfolding). Unfolding normal forms of A] etr ec-terms 
reachable in at most lu steps are unique. That is: if Mi L M 2 for a A| e trec-term L, 
and infinite A-terms Mi and M 2 , then Mi = M2. 

Proof sketch Let us assume that L unfolds to Mi and M 2 by the -* v -reduction sequences 
Ti and r%. Consider for n e N the first reduct Pi (P 2 ) in the sequence t\ (t 2 ) that is stable 



above depth n. It follows from confluence of the rewrite relation (Proposition 3.11 ) that 
Pi and P 2 have a common ^ v -reduct P 3 . Since the rules of R v do not create a redex at 
lower depth than the occurrence of the left-hand side ('redexes are not pushed upwards') 
in the rewriting sequence from Pi (P2) to P3, no contractions take place above depth n. 
In that sense Pi and P2 are 'equal up to depth n\ Such a notion is, however, still in need 
of precise formulation for CRS-terms. The argument can be repeated for arbitrary n 6 N, 
therefore Mi and M 2 agree on arbitrarily large outermost contexts. □ 

As a consequence of the lemma above, the rewriting system _R V defines a partial function 
IA for unfolding A| etrec -terms. 

Definition 3.17 (unfolding as a mapping). We define the partial unfolding function: 

U ■■ Ter( A| etrec ) - Ter(\°°) 

L ^ M if L -+% M 



We say that L expresses M if U(L) = M. Uniqueness of IA follows from Lemma 3.15 



Example 3.18. The terms from Example |1.2| both express the same A°°-term: 
A.t. letrec r in x : r ->» v Xx.x : x : x : . . . ««- v Ax. letrec r in x ■ (x : r) 

Example 3.19 (i? v -unproductive A| e trec-term). As an example for a non-unfoldable 
Aietrec-term, consider letrec / = letrec g = f in g in / which is not in dom(U) and the cyclic 
rewriting sequence: 





letrec f = 


letrec g = 


f in !>; in f 


V-rec 


letrec f = 


letrec g = 


f in g in letrec g = f in 


V-lctrcc 


letrec f = 


letrec g = 


fing; g = f ing 


V-rec 


letrec f = 


letrec g = 


fing; g = f inf 


V-red 


letrec f = 


letrec g 


f in g in f 



We will revisit this example in Example |6.21| to illustrate the cyclicity proof. 

We also define an unfolding function which is complete on Ter (A| e trec) by mapping 
non-unfoldable subterms to 1, which yields Bohm trees in A°°. 

Definition 3.20 (partial unfolding). Ter ± (A°°) denotes infinite terms over u {l}, 
where 1 is a constant symbol. 

U' ■■ Ter(A| etrec ) -> Ter ± (A°°) 

L«M if L ^ M 

Thereby ^ s the infinitary rewrite relation induced by the rewrite relation ->- ± that 
extends -» v by mapping .Ry-root-active subterms to 1. 
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4. Regular and strongly regular terms in A°° 

For infinite first-order trees the concept of regularity is well-known and well-studied 
(Courcelle, 19831. Regularity of a labelled tresis defined as the existence of only finitely 



many subtrees and implies the existence of a finite graph that unfolds to that tree. In 
this section we generalise the notion of regularity to trees with a binding mechanism, the 
A°°-calculus specifically. We give a definition for regularity which corresponds to regularity 
of a term when regarded as a first-order tree, and for strong regularity, which will be 
shown in the following sections to coincide with A| etrec -expressibility. 

We define regularity and strong regularity in terms of rewriting systems that will 
be called Reg and Reg*. Rewrite sequences in these systems inspect a given term 
coinductively in the sense that a rewrite sequence corresponds to a decomposition of the 
term along one of its paths from the root. Both Reg and Reg + extend a kernel system 
Reg' comprising three rewrite rules which denote whether the position just passed in the 
tree is an abstraction or an application and in the second case whether the application is 
being followed to the left or to the right. 

The rewriting systems are defined on A°°-terms enriched by what we call an abstraction 
prefix, by which the terms can be kept closed during the deconstruction. This is crucial 
for the definition of the rewriting system as a CRS. While intuitively it is clear that the 
A-term M N is composed of the subterms M and N, abstractions are more problematic. 
In a first-order setting one could say that Xx.M contains M as a subterm, but if x occurs 
freely in M then M would be an open term. That means that the scrutinisation of an 
abstraction would be able to go from a closed term to an open term, which would run 
counter to the interpretation of a higher-order term as an a-equivalence class. In the 
definition of the rewriting systems below this issue is resolved as follows. When inspecting 
an abstraction, the binder is not left out but moved from the scrutinised subterm into 
the prefix. That guarantees that the term as a whole remains closed. 

In Xx.Xy.xxy for instance the path from the root to the second occurrence of x then 
corresponds to the rewrite sequence: 

QXx.Xy.xxy -*\ (Xx)Xy.xxy -*\ (Xxy)xxy -»-@ (Xxy)xx -*® t (Xxy)x 

The Reg and the Reg system extend Reg' by a scope- delimiting rule, which signifies 
that the scope of an abstraction has ended, whereby both systems are based on different 
notions of scope. 

Reg relies on what we simply call scope of an abstraction: the range from the abstraction 
up to the positions under which the bound variable does not occur anymore. 

We base Reg + on a different notion of scope, called scope + , which is strictly nested. 
The scope + s of an abstraction extends its scope by encompassing all scope + s that are 
opened within its range. As a consequence, scope + s do no overlap (see Figure pi). 



Precise definitions of scope and scope + are given later in Definition 7.2 



* By a 'labelled tree' we here mean a finite or infinite tree whose nodes are labelled by function symbols 
from a first-order signature such that the arity of the function symbol in a node determines the number 
of successors of the node. 
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Fig. 5. The difference between scope and scope + 



When every scope + is closed by the scope + -delimiting rule then the sequence of rewrite 
steps alone (i.e. without the terms themselves) unambiguously determines which abstrac- 
tion a variable occurrence belongs to. The rewrite sequence from above would then have 
one additional scope + -delimiting step asserting that the variable at the end of the path is 
indeed x and not y: 

QXx.Xy.x xy -*\ (Xx)Xy.xxy -*\ (Xxy)xxy -»@ (Xxy)xx (Xxy)x ->s (Xx)x 

The abstraction prefix not only keeps the term closed but also denotes which scope + is 
still open, which provides the information to decide applicability of the scope + -delimiting 
rule. The last step closes the scope + of y, therefore that variable is removed from the 
prefix. The rewrite sequence for the path to the occurrence of y does not include an 
scope + -delimiting step: 

QXx.Xy.xxy — >• a (Xx)Xy.xxy —>\ (Xxy)xxy ->■©, (Xxy)y 

Ultimately, the Reg + rewriting system defines nameless representations for A°°-terms 
related to the de-Bruijn notation. Considering the de-Bruijn representation of the above 
term AA(5'0)(S'0)0 we find that the position of the -^s-steps indeed coincides with the 
position of the S markers. However, the rewrite system Reg* permits more flexibility for 
the placement of -^s-steps, an idea also used in (van Oostrom et al., 2004). For example 



the path from above to the second occurrence of x can also be witnessed by another 
rewrite sequence 

() Xx.Xy.xxy -*x (Xx)Xy.xxy ->\ (Xxy)xxy -^-@ (Xxy)xx -+s (Xx)xx -►©j (Xx)x 

where the scope + of y is closed earlier. This would correspond to A A (5 (00)) in de- 
Bruijn notation, more precisely, in a variant of the de-Bruijn notation which permits 
the scope/scope + -delimiter S to be used also within the term, before abstractions and 
applications. 



Definition 4.1 (CRS-terms with abstraction prefixes). The CRS-signature for 
(A), the A-calculus with abstraction prefixes, extends the CRS-signature for A (see 
3.3) and consists of the set £(a) = £a u {P re n I n 6 N} of function symbols, 



Definition 



where for n e N the function symbols pre„ for "prefix X- abstractions of length n are unary 
(have arity one). Using the syntactical variables for terms in A, CRS-terms with leading 
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prefixes pre„([a;i] . . . [x„]M) will informally be denoted by (Aa;i . . .x n )M, abbreviated 
as (Xx)M. 

Definition 4.2 (The CRSs Reg', Reg, Reg* for decomposing A-terms). Consider 
the following CRS-rules over signature ^(\)' 



KQ ) 



pre n ([x 1 . . . x n ]app(Z Q (x), Z^x))) pre„([x 1 ...x n ]Zi(x)) (i e {0,1}) 
pre„([xi . . . x n ]abs([x n+1 ] Z(i))) -» pre„([xi . ..x n+1 ]Z(x)) 
pre„ +1 ([xi . . . x n+ i] Z(xi,. . . , x n )) -* pre„(|>i . . . x n ] Z(xx,. . . , x n )) 



/ dcl\ P re ri+l( l x l ■ ■ ■ x n+lj Z(xi , . . . , Xi-i , Xi+\ , ■ . ■ , X n +\) ) -» 

(g ) : 

P r ^ra ( [•'"1 ■ • ■ x i~\ x i+l ■ ■ ■ X 7l\ Z(x\ , . . . , Xi-i , Xi+\, . . . , X n+ \ ) ) 

By Reg' we denote the CRS with rules g ; and g x . By Reg (and respectively, by Reg*) 
we denote the CRS consisting of all of the above rules except the rule g 5 (except the 
rule g del ). The rewrite relations of Reg', Reg, and Reg* are denoted by -*- re g-j -^reg and 
^rcg+, respectively. And by -^<a , ->-\, -^s, -+deh we respectively denote the rewrite 

relations induced by each of the single rules g®° , g® 1 , g x , g s , and g del . 

Assuming that the translation between the formal and the informal notation is facile, 
for better readability we will from now on rely on the latter. Here are the rules from 
above in informal notation: 



KQ ) 

(e s ) 
(p dcl ) 



(Xx 1 ...x n )M M 1 -> (Ax 1 ...x„)M i (*6{0,1}) 
(Xxi . ..x n )Xx n+1 .M -> (Xxi . ..x n+1 )M 

(Xxi . . . x n+ i)Mo -»■ (Axi . . . x n )Mo (if the binding Xx n+ i is vacuous) 
(Xxi . ..x n+ i)M -> (Xxx . ..Xi-iXi+i . ..x n+1 )M 

(if the binding Xxt is vacuous) 



Remark 4.3 (only Reg* defines nameless representations). Considering the graphs 
from Figure [fj] without labels on their nodes we see that only from the Reg* graph the 
original term could be reconstructed unambiguously. For example, the path to the 
rightmost occurrence of x has the rewrite sequence in Reg 



which would also be an admissible to witness an occurrence of y at the same position. 
This ambiguity plays a role for the definition of A-transition graphs in Section [9j and is 
discussed in that context in Remark 19*21 



Note that the following relationships between rewrite relations: ->>eg- is contained in 
► reg and ^rcg+j and since the rule g del generalises the rule g s , -*- r eg+ is contained in -*- re g. 



Proposition 4.4. Every rewrite sequence in Reg* corresponds directly to a rewrite 
sequence in Reg by exchanging ->s-steps with del-steps. 

Our interest will focus on the subset of terms with an outermost abstraction prefix 
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Fig. 6. The sub-ARSs induced by ()Xx.Xy.xxy. Note, that in the pictures the 
nodes do not display the entire (A°°)-ternis but only their prefixes. 

symbol and no other occurrences of such symbols. Note that the rules in Reg and Reg + 
guarantee that every reduct of a term of the form (Xx)M is again a term of this form. 
Therefore we dchne: 

Definition 4.5 (prefixed A°°-terms). By Ter((A°°)) we denote the set of closed 
iCRS-terms over £(a) with the restriction that 

— every term M e Ter((A°°)) has a prefix at its root and nowhere else, or in other words: 
M is of the form pre n ([a;i] ■ • ■ [%n] M) where M does not contain any occurrences of 
function symbols pre 4 for i e N 

— and that otherwise a CRS abstraction can only occur directly beneath an abs-symbol. 



Ter((A°°)) is more formally specified in Definition 6.2 



Proposition 4.6. Ter((A°°)) is closed under ^ r cg- ; ^rcg, and ->-reg + - 

Definition 4.7 (the ARSs Reg', Reg, Reg + ). We denote by Reg' , Reg and Reg + the 
infinite abstract rewriting systems (ARSs) induced by the iCRSs derived from Reg' , Reg, 
Reg + , restricted to terms in Ter((A°°)). 

The rewrite relations of Reg' , Reg and Reg + will be denoted by the same symbols used 
for Reg' , Reg and Reg + . Since all of our considerations will refer to the restricted set of 
terms, this should create no confusion. 

Proposition 4.8. The restrictions of the rewrite relations as defined in Definition |4. 21 to 
Ter((A°°)), the set of objects of Reg' , Reg, and Reg + , have the following propertiesPl 



Mind the restriction here to terms in Ter(A°°). 
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(i) ^dci is confluent, and terminating. (Xx)x is the only term in -»• del-normal form. 

(ii) ^dci one-step commutes with -*-@ , and one-step sub-commutes with -»s : 

■^dcl • -*X — ~*\ ' "^dcl *~de\ ' - -+@ t ' "^del (* 6 {0, 1}) 

^-del ' - ' ""del 

(iii) - -+deh and consequently, -*- reg + £ ^rcg- Furthermore, -*- re g- % ^rcg+ % -^reg ■ 

(iv) ->s is deterministic, hence confluent, and terminating. Every term in -*s-normal form 
is of the form (Xx\ . . . x n )x n . 

(v) one-step commutes with -+\, -^<a , and 



(Z6{0,1}) 



(Xx)x is the sole term in -» rGg -normal form. Every -> rog + -normal form is of the form 
(Xxl. 

(vi) ->-rog and ->- r cg+ are finitely branching, and, on finite terms, terminating. 

Proof. Most properties, including those concerning commutation of steps, are easy to 
verify by analysing the behaviour of the rewrite rules in Reg on terms of Ter((A)). 

For -^ rcg + ^ ^-rog in ([hi]) note that, for example, (Xxy)y -^ rcg (Xy)y by a ^> del-step, but 
that (Xxy)y is a -^s-normal form, and hence also a -^ rcg +-normal form. 

In item (|vi| we first argue for finite branchingness of -^ rcg and ^- r cg+ on Ter(A°°): this 
property is entailed by the fact that, on a term (Xx)M with just one abstraction in its 
prefix, of the constituent rewrite relations -»-@ , -»-<a 1 , -»-s> ^dei of -^ rog and ^■ rog + 
only ->dci can have branching degree greater than one, which in this case then also is 
bounded by the length |i| of the abstraction prefix. For termination of -^rcg and -> r cg+ 
on finite terms with just a leading abstraction prefix we can restrict to ->-rcg? due to (jmj), 
and argue as follows: On finite terms in Ter((A°°)), in every -> rcg -rewrite step either the 
size of the body of the term decreases strictly, or the size of the body stays the same, 
but the length of the prefix decreases by one. Hence in every rewrite step the measure 
(body size, prefix length) on terms decreases strictly in the (well-founded) lexicographic 
ordering on N x N. □ 



As a consequence of Proposition 4.8 |i|) and |iyj, the rewrite relations -^dci and ->s are 
normalizing on Ter(A°°). For every term M e Ter((A°°)) we will denote by Mjdei and 
Alls the normal forms of M with respect to -*dei and ->S) respectively. And by -* d ei (by 
-» s ) we denote the many-step rewrite relation for ->-del (f° r ~*s) that leads to a ^-del-normal 
form (to a -^s-normal form), that is: M -» dcl N if Mjdel = N (and respectively, M -»- s N 
if Mis = N), for all terms M and N. 

Proposition 4.9. The following statements hold: 

(i) Let (Xx)M a term in Reg with \x\ = n e N. Then the number of terms (Xy)N in Reg 
with (Xy)N -» de i (Ax) A/ and \y\ = n + k e N is 

(ii) Let T be a finite set of terms in Reg, and I e N. Then also the set of terms in Reg that 
are the form (Xy)N with \y\ < k and that have a -» d ei-reduct in T is finite. 

Proof. If (Xy)M(y) = (Xy 1 . . . y n+k ) N(yi, . . . , y n+k ) -» de i (Xx t . . . x n )M(x 1 , . . . , x n ) = 
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(Xx)M(x), then it follows that there are i\, . . . , i n e {1, . . . , n+ k} with i\ < ii < . . . < i n 
such that the term (Xy)M(y) is actually of the form (Xyi . . .y n+k )N(y il , . . . ,yi n ) and 
furthermore (Xy^ . . ■yi„)N(yi 1 , . . . ,y;„) = (Xx\ . . . x n )M(xi, . . . , x n ). Hence the num- 
ber of terms (Xy)M(y) with -^dd-reduct (Xx)M(x) is equal to the number of choices 
ii,...,i n e {1, . . . ,n + k} such that i\ < i% < ... < i n . This establishes statement 
Statement ^ is an easy consequence. □ 

Lemma 4.10. On Ter((A°°)), the rewrite relations -^ rog and -^ rcg + have the following 
further properties with respect to -»-del> ^*Sj ~*deV anc ^ ~^s : 

«-del--*reg £ (-"del ' ^rcg- ) ' ^dcl (4.1) «"del • -*Teg+ £ (-+S ' "^reg- ) ' ^dcl (4.3) 
^dcl • ^rog £ (^dcl ' ^rcg-)* ' ^dcl (4.2) ^ d ol • ^rcg+ £ ("4 ' ^rcg-)* " ^dcl (4.4) 

Proof. These commutation properties, which can be viewed as projection properties, can 
be shown by arguments with diagrams using the commutation properties in Proposition |4.8| 

□ 

Remark 4.11. The commutation properties in Proposition |4. 10| can be refined to state 
that ^-A-steps project to -*A-steps, and -*@ - and ^©j-steps project to ->@ - and ^©j-steps, 
accordingly. 

As an immediate consequence of Proposition |4. 10] we obtain the following lemma, which 
formulates a connection via projection between rewrite sequences in Reg (in Reg + ) and 
-^■del-eager (^s-eager) rewrite sequences in Reg (in Reg + ) that do not contain -*-@ 0) 
or -steps on terms that allow del-steps (->s-steps). 

Lemma 4.12. The following statements hold: 

(i) Every (finite or infinite) rewrite sequence in Reg of the form: 

t : (Xx Q )M Q -> Teg (ASi)A/i -^ rcg . . . ->- reg (Xx k )M k -> res . . . 

projects over a rewrite sequence ir : (Xxq)Mq -»dci (Xx' )Mq to a -^del-eager rewrite 
sequence in Reg of the form: 

f : (Xx' )M ^dcl ' ^rog- (A5i)Ml ^dcl ' ^rog- • • • 

• ■ • ~^dcl ' ^rcg- (^'k)Mk -*dcl ' "^reg" ■ ■ ■ 

in the sense that (Aii)Mj -»dei (Ax^)Mj for alii e N less or equal to the length of r. 

(ii) Every (finite or infinite) rewrite sequence in Reg + of the form: 

r : (Xy a )N -> r eg+ (Ayi)iVi -> re g+ • ■ ■ -*-re g + (Xy k )N k -^ rcg+ . . . 

projects over a rewrite sequence ir ; (Xyo)Ng -»dci (Xy' Q )No to a -^s-eager rewrite 
sequence in Reg + : 

f : (Xy )N Q ^5 ' ^rog- (Vl)-^l "4 ' ■ • • 

■ • ■ ~*S ' ^rcg- (^y'k)N k ^dcl ' ^rcg- • • ■ 

in the sense that (Xy^Ni -»s (Ay^)iVj for alH e N less or equal to the length of r. 
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Remark 4.13 (non-determinism of ^ rcg and -^ rog +)- On terms in Ter((A°°)), which 
have just one prefix at the top of the term, there are two different causes for non- 
determinism of the rewrite relations in Reg and Reg + : First, since the left-hand sides 
of the rules g &0 and g® 1 coincide, these rules enable different steps on the same term, 
producing the left- and respectively the right subterm of the application immediately 
below the prefix. Second, the rules g del and g s can be applicable in situations where also 
one of the rules g °, g , or g x is applicable. Whereas the first kind of non-determinism 
is due to the 'observer' having to observe the two different subterms of an application in 
a A-term, the second is due to a freedom of the observer as to when to attest the end of a 
scope (of some kind) in the analysed A-tcrm. 

In the definition below we define strategies on Reg and Reg + that resolves the second 
source of non-determinism while leaving the first kind intact. As a result the sub-ARS 
induced by some term M with respect to Reg + correspond structurally to the term graph 
of M. 

Definition 4.14 (scope/scope*-delimiting strategy). We call a strategy § for Reg 
(for Reg + ) a scope- delimiting strategy (a scope + -delimiting strategy) if the source of a step 
is non-deterministic (that is, it is the source of more than one step) if and only if it is the 
source of precisely a -^@ -step and a -»@ 1 -step. 

For every such strategy S, we denote by ->@ ,s, -+a 1) s, ->a,s j and ->dei,S (-*s,s), the 
rewrite relations that are induced by those steps according to S that are induced by 
applications of the rules g®° , p® 1 , g x , and g dcl (g s ), respectively. 

Remark 4.15. Note the following more verbose formulation of the condition for a 
strategy § for Reg (for Reg + ) to be called a scope-delimiting (scope + -delimiting) strategy: 

— every source of a step is one of three kinds: the source of a -^-step, the source of 
a ^del-step (a -^-step), or the source of both a -*-@ -step and a -►©j-step with the 
restriction that (in all three cases) it is not the source of any other step. 

Mindful of the fact that sources of -^>-steps are not sources of -^-steps, or vice versa, 
in Reg (in Reg + ), this condition can be relaxed to the equivalent formulation: 

— no source of a -^A-step or a -^@ ; -step for i € {0, 1} is also the source of a -^del-step (a 
-►s-step), and every source of a -^@.-step for i € {0, 1} is the source of both a ^@ - 
and a ->■<§! -step, but not the source of any other step. 

Definition 4.16 (eager and lazy scope/scope*-delimiting strategies). The eager 
scope- delimiting strategy § C ag (lazy scope-delimiting strategy Si azy ) for Reg is defined as 
the restriction of rewrite steps in Reg to eager (and respectively, to lazy) application of the 
rule g del : on a term M e Ter((A)), applications of other rules are only allowed if g dcl is 
not applicable (applications of g dcl are only allowed when other rules are not applicable). 
Analogously, the eager scope + -delimiting strategy §£ ag (the lazy scope + -delimiting strategy 
SJazy) for Reg + is defined as the restriction of rewrite steps in Reg + to eager (respectively, 
to lazy) application of the rule g s . 

Remark 4.17 (history-aware versus history-free scope-delimiting strategies). 
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? ? ? ? 




§cag §lazy §eag §lazy 

Fig. 7. sub-ARSs induced by Xx.Xy.xxy with respect to different Reg and Reg + 
strategies; compare: Figure [6| Again note, that the labels do only show the prefixes 
associated with each term. 

The history- free strategy obtained by projection from a history-aware scope-delimiting 
strategy is not in general a scope-delimiting strategy. This is due to the non-determinism 
which may be introduced by the projection. Consider for example the term MM with 
AI = Xx.Xy.xxy and the history-aware strategy S constructed by using S oag on the left 
component and Si azy on the right component of MM. The sub-ARS induced by MM 
then corresponds to the graphs of the sub-ARSs for § cag and §i azy as depicted in Figure [7] 
placed side by side with an additional connecting node at the top (also some leafs are 
merged). The induced sub-ARSs of the history- free strategy obtained by projection, 
however, resembles the graph from Figure [6] (with one additional application node at 
the top). That graph however bears the non-determinism which is not permitted for a 
scope-delimiting strategy, in the form of the existence of a source of both a ->s and a 

-►reg ste P- 

The following proposition formulates a property of the eager scope-delimiting (scope + - 
delimiting) strategy in Reg (in Reg + ) that assigns it a special status: the target of every 
rewrite sequence with respect to § oag (with respect to Sg ag ) can be reached, modulo some 
final ^del-steps (^s~ s t e P s )7 a l so by a rewrite sequence with respect to an arbitrary scope- 
delimiting (scope + -delimiting) strategy. Furthermore, rewrite sequences with respect to 
S oag (with respect to Sg ag ) are able to mimic rewrite sequences with respect to an arbitrary 
scope-delimiting (scope + -delimiting) strategy, up to trailing -^del-steps (^s~steps) applied 
to the latter. 

Proposition 4.18. For all scope-delimiting strategies S on Reg, and scope + -delimiting 
strategies S + on Reg + the following statements hold: 

(i) ~*Seag factors into -»§ and -»dci > and ^sj ag factors into and 
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(1) (2) (3) 

Fig. 8. Induced sub-ARS as graphs (only with prefixes as node labels) for: 

(1) Example 1 1 . 1 1 with the eager S eag scope-delimiting strategy for Reg. The dotted 
line denotes node equality. The two connected nodes represent identical 
(a-equivalent) terms. They are represented by two separate nodes instead of a 
single one to avoid confusion as to which variable (it is a) is deleted from the prefix 
by the adjacent del-edge. 

(2) Example [Li] with an arbitrary scope + -delimiting strategy for Reg + . The vertical 
dots denote an infinite growth of the graph. 

(3) Example |3.18| with an arbitrary scope + -delimiting strategy for Reg + . The 
horizontal dots denote an unspecified continuation of the graph, which cannot be 
made explicit as the nature of the ': '-operator has been left open. 

(ii) "^Scag i s cofinal for with trailing ->dci-steps, and - >, s+ a is cofinal for ^-§+ with 
trailing ->-s-steps. 

Definition 4.19 (generated subterms of A°°-terms). Let S be a scope-delimiting 
strategy for Reg with rewrite relation ->■§ (let S + be a scope-delimiting strategy for Reg + 
with rewrite relation -»■§+). For every M € Ter(A°°), the set STg(M) of generated subterms 
of M with respect to Reg and S (the set ST§(M) with respect to Reg + and §) is defined 
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as the set of -»§-reducts (the set of -»§+-reducts) of ()M via the mappings: 

ST S : Ter(A~) — P(Ter((A""))) ST+ + : Ter(A~) — P(Ter((\°°))) 

M i— ► STg(Af) := (Af-» s ) M i— ► STj + (M) := (M -»s+) 

Definition 4.20. Let A be an abstract rewriting system, and S a strategy for A. We say 
that an object a in .A is S-regular in A if the ARS (a -»§) induced by a (and consequently 
the set of -»§-reducts of a in A) is finite. 

Definition 4.21 (regular and strongly regular infinite A-terms). An infinite 
A-term M is called regular (strongly regular) if there exists a scope-delimiting strat- 
egy § for Reg (a scope + -delimiting strategy S + for Reg + ) such that M is S-regular (is 
S + -regular). 

Note that an infinite A-term M is regular (strongly regular) if and only if the set ST§(M) 
(the set ST§+(M)) of generated subterms of M with respect to some scope-delimiting 
strategy § (with respect to some scope + -delimiting strategy S + ) is finite. 

Proposition 4.22. The following statements hold: 

(i) Every strongly regular infinite A-term is also regular. 

(ii) Finite A-terms are both regular and strongly regular. 

Proof. For statement <JTJ) , let M be an infinite A-term, and let S be a scope-delimiting 



strategy for Reg + such that ST§(M) is finite. Due to Proposition 4.4 S can be modified 
with the result of a scope-delimiting strategy S' for Reg by exchanging -»-s-steps with 
-►del-steps. Then there is a stepwise correspondence between -^-rewrite sequences and 
->S'-rewrite sequences that pass through the same terms. Consequently, the sets of -»§- 
and ^s-reducts of ()M coincide: ST S <(M) = ST S (M). It follows that ST S ,(M) is finite. 



For statement (pj, note that by Proposition 4.8 |vi| , and Konig's Lemma, every finite 



term in Ter((A)) has only finitely many reducts with respect to 

-^reg) Or — ^rcg + • It follows 

that for every finite A-term M and every scope-delimiting strategy S on Reg, or on Reg + , 
the number of -»g-reducts of ()M is finite, too. □ 

The scope-delimiting strategies in the above definition could be fixed to the respective 
eager versions without changing the notions of regular and strongly regular infinite A-term. 

Proposition 4.23. For all infinite A-terms M the following statements hold: 

(i) M is regular if and only if M is S cag -regular. 

(ii) M is strongly regular if and only if M is S,t ag -regular. 

Proof. We only prove (gj, because (JTiJ) can be established analogously. The implication 
"<=" in ^ follows directly from the definition of regularity. For showing "=>■" , let M be an 
infinite A-term that is regular. Then there exists a scope-delimiting strategy S so that 



STg(M) is finite. Since by Proposition 4.18 dil), every -»s oag -rewrite sequence factors 



into an (-»§ • -»dei)-rewrite sequence, it follows that every term in ST§ B ^ g (M) is the 
-»dcrreduct of a term in ST$(M). As every term in Ter((A°°)) has only finitely many 
^»dcrreducts, it follows that also STs eag (M) is finite. □ 



Clemens Grabmayer and Jan Rochel 



24 



Example 4.24 (regular and strongly regular terms). The following examples 
demonstrate the connection between (strong) regularity and, as illustrated in Figure |8j 
the finiteness of the ARSs induced by Reg (Reg + ) strategies. 

— Example 1 1.1 1 is regular but not strongly regular. 

— Example |3.18| is strongly regular. 



For further illustration of the statements made in Example 4.24 let us consider various 
Reg and Reg + rewrite sequences corresponding to infinite paths through the terms. 

Example 4.25. For the term M from Example however we first introduce a finite 
CRS-based notation, as a 'higher-order recursive program scheme'. We can represent 
M by \a.recM{o) together with the CRS-rule recM(X) -^ re( . M Xx.reCM(x) X . It holds 
that Xa.recM(o-) -"*rec M M. Using this notation we can finitely describe the infinite path 
down the spine of Example by the cyclic S eag rewrite sequence: 

() Xa.rec M (a) 
~ *a (Xa)rec M (a) 
-> reCM (Xa)Xb.rec M (b)a 
-*X (Xab)rec M (b) a 
-^@ (Xab)rec M (b) 
-►del (Xb)rec M (b) 

(Xa)recM(a) 

In Reg + the rewriting sequence for the same path is invariant over all scope + -delimiting 
strategies and necessarily infinite: 

()Xa.rec M {a) (Xa)rec M (a) -*rec M 

(Xa)Xb.rec M (b) a —>-\ (Xab)rec M (b) a (Xab)rec M (b) -+rec M 

(Xab)Xc.recM(c)b (Xabc)recM(c) b -*-@ (Xabc)recM(c) ~^rec M 

(Xabc)Xd.recM(d) c ->\ (Xabcd)recm(d) c -^-@ (Xabcd)recM(d) -+rec M 

(Xabcd) Xe.recM(e) d ->\ (Xabcde)recM(e) d ... 

Example 4.26. As an illustration of Example |3.18| we study a slightly modified version 
M of the term, defined as the unfolding of letrec / = Xxy.fyx in /. It is strongly regular 
since the infinite path through the term can be witnessed by this cyclic Reg + rewriting 
sequence: 



()M 

QXxy.Myx —>\ 

(Xx)Xy.Myx — >• a 

(Xxy)Myx -^@ 

(Xxy)My -^@ 

(Xxy)M -+ s 

(Xx)M -+ s 
QM 

See also Figure [9] for a graphical illustration of the induced sub-ARS. 
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^cag S lazy 

Fig. 9. Two Reg* sub-ARSs induced by the infinite unfolding of 
letrec/ = Xxy.fyx in /. 

Remark 4.27. The restriction of Proposition |4.23| to the eager scope-delimiting strategy 
cannot be relaxed to arbitrary scope-delimiting strategies. The term in Example |4.26| for 
instance is Sg ag -regular but not Sj^ zy -regular (Figure |9j). 

Definition 4.28 (grounded cycles in Reg, Reg + ). Let r : (Xxq)Mo (Xxi)Mi -*■... 
be a finite or infinite rewrite sequence with respect to -*rcg or ->- Tes + ■ By a grounded cycle 
in r we mean a cycle (Xx i )M i -> (Xx i+ i)M i+ i (Xx i+ k)M i+k = (Xxi)M t , in r, 

where ieN and k > 1, with the additional property that > for all j € {0,1, ... , k) 
(i.e. the lengths of the abstraction prefixes in the terms of the cycle is greater or equal to 
the length of the abstraction prefix at the first and final term of the cycle). 

Proposition 4.29. Let M be an infinite A-term that is S-regular (S + -regular) for some 
scope-delimiting strategy S (scope + -delimiting strategy § + ). Then every infinite rewrite 
sequence with respect to S (with respect to S + ) contains a grounded cycle. 

Proof. Since the argument is analogous in both cases, we only treat strongly regular 
terms. Let M be an infinite A-term that is S + -regular for some scope + -delimiting strat- 
egy S + , and let r : M = (Xx )M -»§+ (Axi)Mi ... be an infinite rewrite sequence. 
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As M is § + -regular, the sequence {{Xx^Mi}^ of terms on r contains only finitely 
many different terms. Let I := liminf{|xi|}i E N, that is, the minimum of abstraction prefix 
lengths that appears infinitely often on r. Let {(Xxi j )Mi j be the subsequence of 
{(Xii) -MjjjgN consisting of terms with prefix length I, and such that, for all k > io, \xk\ > I- 
Since also this subsequence contains only finitely many terms, there exist ji, ji € N, ji < 72 
such that (Xx iji )M iji = (Xxi j2 )M ij2 . By the choice of the subsequence it follows that 
(Xx ln )M ln . . . (Xx lj2 )M l]2 is a grounded cycle in t. □ 

Rounding off this section, we describe a motivation for the system Reg + in terms of a 
operation 'parse' that at the same time (i) stepwisely decomposes infinite A-terms into 
its generated subterms, and (ii) stepwisely recombines the decomposition analysis of 
generated subterms encountered with the original term as the result in the limit. With 
this purpose in mind, we define, below, a CRS Parse* . 

Definition 4.30 (Parse*). Let S(A). pa rse+ = ^(a) u {P arse ^ | n e N} be the extension of 
the CRS-signature for (A), where for n e N, the symbols parse^ have arity n. By Parse* 
we denote the CRS with the following rules: 

(0parse+) : parse;(X„, pre n ([x n ] app(Z (x n ), Z^Xn)))) 

a pp( parsed (X n , pre n ([x n ] Z (x n ))), parse^X^ pre n ([x n ] Zi(£„)))) 

(PparseO: parse* (X n , pre„ ( [x n ] abs( [y] Z(x n , y) )) ) -* 

abs([y] parse+ +1 (X„, y, pre n+1 ([x„] [y] Z(x n , y)))) 

(fi'parse0 : parse^ (^i , • . • , X n+1 , pre„ +1 ( [x n+1 ] Z(x n )) ) 

parse+(Jfi, . . . ,X n , pre n ([x n ] Z(x n ))) 

(£parse+) : P arSe « ( X l , ■ ■ ■ . X n , P^n ( [%n] X n ) ) X n 

We denote by -^ parS c+ the rewrite relation induced by this CRS. 

Remark 4.31. Observe that the rules (g® 1 ) for i e {0, 1}, {g x ), and {g s ) of the CRS Reg* 
are contained within the rules (gf 3rse +), (Qp arS e+)> (£parse+)j respectively, of the CRS Parse* , 
in the sense that applications of the latter rules include applications of the former. 

This has as a consequence that repeated -^ parso + -steps on a term ()M lead to terms that 
contain generated subterms of M as closed subexpressions. Furthermore -^ parso + -rewrite 
sequences on ()M are possible that move redexes simultaneously deeper and deeper, 
analysing ever larger parts of M, and at the same time recreating a larger and larger 
A-term parts (stable prefix contexts) of M, the original term. 

Proposition 4.32. For every term M e Ter(A) it holds: 

(i) Parse* analyses M into its generated subterms: If parse^O M) -» parsc + M', then all 
subexpressions starting with pre„ (for some n € N) in M' are generated subterms of M. 
Moreover, for every generated subterm (Xy)N of M, there exists a -» parsc + -reduction 
M" of M such that (Xy)N is contained in M". 

(ii) Parse* reconstructs M: parseg(()M) -*» parS c+ M. 

Example 4.33. Let M be the infinite unfolding of letrec / = Xxy.fyx in / for which we 
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use as a finite representation the equation M = Xxy.Myx. In Parse* , M is decomposed, 
and composed again, by the infinite rewrite sequence (see also Example 3.12): 

parseS(QM) 
= parseg(() Xx.Xy.My x) 
-►parse+.A Ax' . parsej (x', (Xx')Xy.Myx') 



(Xx'y')My' x') 

(Xx'y')My') parsers', y', (Ax'y')x') 
(Xx'y')My') parsej"(x', (Xx')x') 
(Xx'y') M y ) x 

(Xx'y')M) P arse*(x',y', {Xx'y')y')x' 



,y', (Xx'y')M) y' x' 



^ P arso+ .\Xx' .Xy' . pa rsej (x 
-* P arse+ . @ Ax' . Ay' . pa rse-J (a; 
-»■ P arsc+ . s Ax' . Ay ' . pa rse^ ( x 
->-par S e+ .0 Ax' . Ay' . pa rse^ (x 
-* P arse+ . @ Ax' . Ay' . pa rsej (a; 
^ P arsc+ .o Ax' . Ay' . pa rsej (x 
^ P arso+.sAx'.Ay'.parse|(a;', (Ax')A/) y x 
^ P arsc+ .s Ax' . Ay' . parsed (QM)y'x' 

= Ax' . Ay' . pa rseg (() Xx.Xy.M y x) y' x' 

""Sparse"*". A • ■ • 

Note that the generated subterms of M appear as the last arguments of the functions 
parse^ in this rewrite sequence. 



5. Observing A| e t r ec-terms by their generated subterms 

In this section we adapt the concepts developed so far for the infinitary A-calculus to 
A| etr ec. By combining the rules of ii v with those of Reg and Reg*, respectively, we obtain 
the CRSs i?e<7| etrec and i2eg^ trec for the deconstruction of A| e trec-terms furnished with 
an abstraction prefix. We define scope-delimiting and scope + -delimiting strategies for 
Reg\ etrec and Regl etrec as before by excluding all non-determinism except for sources of 
-*-@ - and -►Oj-steps. 

Definition 5.1 (The CRSs Reg and Reg* for decomposing A| etrec -terms). We 



extend S> letrec (see Definition 3.3) by function symbols pre„ with arity one to obtain the 
signature £(Ai etrec ) = SA| etrec u {P re « | n e N} on which we define the Aietrec-calculus with 
abstraction prefix. We denote the induced set of (finite) terms by Ter((X\ etrec )) and we 



adopt the same informal notation for (A| etrec ) as for (A°°) (see Definition 4.1). 

On the signature SA| etrec we define the CRS Reg^ trec (the CRS Regf etrec ) with the rules 
as the union of the rules of R v and Reg (the union of the rules of R v and Reg*). 

Definition 5.2 (The ARSs Reg letrec , Regl etrec ). We denote by Reg [etrec and i?ey^ trec the 
abstract rewriting systems (ARSs) induced by the CRSs i?e£/| etrec and Reg^ etrec , respectively. 
By Reg letrec , and i?ey^ trec we denote the ARSs that result by restricting the ARSs Reg letrec , 
and Regf etrec , respectively, to the subset Ter( A| etrec ) of terms. 



Like in Definition 4.7 the symbols used for the rewrite relations of Reg letrec and Reg] 



letrec 
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are overloaded; we use the same symbols to denote the restrictions of said rewrite relations 
to Ter((A| etre c))- 

The lemma below formulates a number of simple rewrite properties for i?e<?| etrec and 
Regl etrec , mainly concerning the interplay between unfolding and decomposition steps. 

Lemma 5.3. On Ter((\\ etrec )), the rewrite relations in Reg letrec and Regf etrec have the 
following properties: 

(i) -»- v one-step commutes with each of -»^, -»■©„, ->■©!, and ^dci : 

(ii) ^-rcg m R e 9\ e trec and ^rcg+ m R e 9\etrec have the same normal forms as -^rcg m Reg and 
- > re g+ in Reg + , respectively: (Aa;)x is the single term of (A| etrec ) in -»- reg -normal form. 
Every -> reg +-normal form a term in (A| etrec ) is of the form (Axi . . . x n )x n . 

Proof. The commutation properties in ^ are easy to verify by analysing the behaviour 
of the rewrite rules in Reg\ etrec and in Regl etrec on the terms of Ter((A| etrec )). 

The statement in (JTTJ) follows from Proposition 4.8 and (|vi|): Normal forms with 



respect to -^ rog in Reg letrec and -> reg + in Reg ietrec can only be A-terms without occurrences 
of letrec, since every occurrence of letrec in a A| e trec-term gives rise to a -^ v -redex. □ 

Lemma 5.4. The rewrite relation -*\^ (to infinite -^ v -normal form in at most u steps) 
one-step commutes with -*-,\, -^<a , ->@ I5 -+s, and -^dci : 

^ v -^aE^a-^ v ^ v • -»•©< £ -»•©< • ^ v («e{0,l}J 

y ' — "'del ' y ' del — — del ' y 

This implies, for prefixed terms that have unfoldings that: 

U((Xx)Xy.L )^ x U((Xxy)L Q ) W((A2)L ii) -o« W((A2)L<) (»6{0,1}) 

(Azi . . .x„+i)M ->s (Axi . ..x n )M => U((\xi . ..x n+ i)M) -» s W((Axi . . .x„)M) 
(A5)M^ de i (Ax')M => U{{Xx)M) ^ dcX U{{Xx)M) 

Furthermore it holds: ^ ■ -> v £ «J^. 

Proof. The commutation properties with the rewrite relation ^- ! " can be shown by 



using refined versions of the commutation properties in Lemma 5.3, ([ij), in which the 
minimal depth of unfolding steps is taken account of. When denoting by -►zj 1 the rewrite 
relation that is generated by -^ v -steps of depth > n, then the following properties hold: 

>n >n >n+l >n 

' ^ A - A ' ^ v ' - ' 

>n+l r- >n >n+l r- >n 

^ v • £ ' ^ v ' ^dcl E ""Mel ' ^ v 

Using these properties, strongly convergent -» v -rewrite sequences can be shown to project, 
over ^dci-j an d -^s-steps, to again strongly convergent -»- v -rewrite sequences. 

The property ■ ->- v E can be shown by using refined versions of the elementary 
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diagrams from the confluence proof that take the the minimal depths of steps into account. 

□ 

As for Reg (Reg + ) a we require of scope-delimiting strategies to have deterministic -*-dei- 
steps (^s-steps). As was the case for scope-delimiting and scope + -delimiting strategies 
for Reg and Reg + , these strategies will also here fix all non-determinism except for the 
choice between @ an d @i- 

Definition 5.5 (scope/scope + -delimiting strategy for A| e trec-terms). A strategy § 
for i?e<?| etrec (for Regl etrec ) will be called a scope- delimiting (scope + -delimiting) strategy if: 

— S is deterministic for sources of -^-steps, -> de i-steps (->s-steps), and all letrec-unfolding 
steps (i.e. all -^ r -steps for every rule g r v of i? v ). 

— S enforces eager application of g r ° d : every source of a step in S according to a rule 
different from g™ d is not the source of a -^- ro d-step. 

We say that such a strategy S is a lazy-unfolding scope-delimiting strategy (a lazy- 
unfolding scope + -delimiting strategy) if furthermore: 

— § applies the rules of i? v . rc d only at the root of the term, i.e. directly beneath the 
abstraction prefix except for p™ d . 

— § uses the rules of R v other than reduction rule p™ d in a lazy way: for every source s 
of a step in S with respect to a rule of J2 V other than g™ d it holds that s is not also 
the source of a step, in the ARS underlying the strategy, with respect to one of the 
rules of Reg (of Reg + ) 

For every scope-delimiting strategy S on Reg ietre! . (on Reg^ etrec ), we denote by -»-<§ 0j s, 
->@i,S, -*a,s j -*S,S; -^dci,s , and by -*- v .@.s, -* v .a.s , ^v-nii.s, ^v-roc.s, ^v-ictrcc.s, and 
^v-rcd.s the rewrite relations that are induced by those steps according to S that result 
by applications of the rules g®° , g® 1 , g x , g s 7 g dcl of Reg and Reg* , and the rules g® , g x , 
Qy nil , £>v c . (?v trcc , and g™ d of # v , respectively. 



Remark 5.6. We need to permit the application of g™ d anywhere inside the term 
to handle terms that contain inaccessible bindings in binding groups. Otherwise the 
possibility to apply g del (g s ) may be blocked in the case of a prefixed A| etre c-term (Xx)L 
in which a variable from the abstraction prefix is vacuous with respect to the infinite 
unfolding of (Xx)L but is bound by a term in an inaccessible binding of some binding 
group in L. 

We can however restrict applicability of g T ° d to outermost redexes if we start unfolding 
on reduced terms, that is, ^ v . rc d-normal forms, because then p™ d -redexes can only arise 
at outermost positions (by application of the p®-rule). 



Remark 5.7 (non-deterministic unfolding). Note that in Definition 5.5 we do not 
only require of a strategy to eliminate the non-determinism with respect to g del -transitions 
(g s -transition) but all non-determinism except for g®° / g® 1 non-determinism. This restric- 
tion will play a role later for the definition of A-transition graphs in Section [9j and here 
below for the definition of the projection of scope-delimiting (scope + -delimiting) strategies 
on Aietrec-terms to scope-delimiting (scope + -delimiting) strategies on infinite A-terms. 
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Remark 5.8 (eager application of Q r y d )- By requiring scope/scope + delimiting strate- 
gies to apply g T ° d eagerly we can exploit a useful property with respect to free variables of 
a term: if (Xx)L e Ter(A 

letrec) is in — >y rcc [-normal form, then the free variables occurring 
in M correspond to the free variables oiU(L). 

Note that here we have applied W to a prefixed term which requires an extension of U 
to terms in Ter((A| etrec )). 

Definition 5.9 (unfolding of prefixed terms). We redefine U as a partial function 
over the domain Ter((A| etrec )): 

U : Ter{ ( A letrec ) ) ^Ter{\°°) 

L h- M if L M 



Uniqueness of U follows from Lemma 3.15 



Definition 5.10 (productive terms w.r.t. scope-/scope + -del. strat.). Let I be a 

Aietrec-term, and S a scope-delimiting strategy for i?e<?| etrec , or a scope + -delimiting strategy 
for Regl etrec . We say that L is ^-productive if every infinite rewrite sequence on L with 
respect to § contains infinitely many steps according to -^-@ Q .s , -^@ u s , or -*■>,,§ ■ 

Definition 5.11 (generated subterms of A| etrec -terms). Let S be a scope-delimiting 
strategy for Reg letrec (for Regl etrec ). For every L e Ter(A| etrec ), the set ST§(L) (the set 
STg(L)) of generated subterms of L with respect to Reg letrec and S (with respect to 
Regl etrec and §) is defined as the set of -*-g-many-step reducts (the set of -^g-many-step 
reducts) of ()L via the mappings: 

ST§ : Ter(A| etre c) -» P(Ter((A| etr ec))) ST§ + : Ter(\\ etrec ) P(Ter((\\ etrec ))) 

L » ST S (L) := (()L-» S ) L^STi + (L) := (()L^ S+ ) 

The following lemma states that every scope + -delimiting strategy for Regl etrec (on 
A| etrec -terms), when restricted to the reducts of a A| etre c-term L that expresses an infinite 
A-term M, projects to the restriction of a scope + -delimiting strategy for Reg + (on infinite 
A-terms) to reducts of M. And it asserts a similar statement for scope-delimiting strategies. 
The projection hereby makes use of the commutation properties described in Lemma |5.4| 
between the infinite unfolding and the decomposition rewrite relations 



►A- 



s, and ^dci- 



Lemma 5.12. Let § be a scope + -delimiting strategy § for Reg ]etrec (for Regl etrec ), and 
let L be a A| e trec-term that is S-productive. Then there exists a (history-aware) scope- 
delimiting strategy (scope + -delimiting strategy) S for Reg (for Reg + ) such that the induced 
sub-ARS (U(L)^g) of U{L) is the projection (under the unfolding mapping U) of the 
induced sub-ARS (£-»§) of L, in the sense that for all L' in (L-»g) it holds: 

L' -» v ,s ' ^A/@,/s/dci,s L" ==>• U(L') ^§U(L ) 
U(L') - s M" =^ (3L") [L' -» VlS • -A/^/s/dci.s L" a M" = U(L") ] 
As a consequence, U{L) is S-regular if L is S-regular. 
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Proof sketch We can utilise Lemma 5.4 to make commuting diagrams out of the two 
formulas above (for any given L'), which allows us to determine S with respect to all 
terms in (L-»§). This freedom in the definition of § also guarantees the property in the 
second implication in the lemma. □ 

Definition 5.13 (Parse^ + ). By Parse v + we denote the CRS comprising the rules of 
Parse* as well as the rules from R v . 

Example 5.14. When applied to letrec / = Xxy.fyx in / is Parse v + unfolds and de- 
composes, but at the same time recreates the corresponding infinite A-term (see also 
Example 3.12 and Example 4.33): 

parseo(()letrec/= Xxy.fyx'mf) 
-►v.rec parseo(()letrec/= Xxy.fy x in Xxy.fyx) 
-»V.A parse,5(() Ax. letrec / = Xxy.fyx in Xy.fyx) 
^parsc+.A Ax. parse j(x, (Ax) letrec / = Xxy.fyx in Xy.fyx) 
-*- v .A Ax.parse|(x, (Ax) Ay. letrec / = Xxy.fyx in f yx) 
^parsc+.A Axy. parse^ (x, y, (Xxy) letrec / = Xxy.fyx'm fyx) 

^v.@ Axy. parsed (x,y, (Xxy) (letrec / = Xxy.fyx in fy) (letrec / = Xxy.fyx'm x)) 
^v.rcd Axy.parse2(x,y, (Axy) (letrec / = Axy./ yx in /y) (letrec inx)) 
-►vnU Axy. parsed (x,y, (Xxy) (letrec / = Xxy.fyx'mfy)x) 

,(x,y, (Axy) letrec / = Xxy.f yx'm f y)) (parse^y, (Axy)x)) 
,(x,y, (Xxy) letrec f = Xxy.fyx'm fy)) (parset(x, (Ax)x)) 
,(x,y, (Axy) letrec / = Xxy.f yx'm f y)) x 

,(x,y, (Axy) (letrec / = Xxy.fyx'm f) (letrec / = Xxy.fyx in y)))x 
,(x,y, (Axy) (letrec / = Xxy.fyx'm f) (letrec iny)))x 
,(x, y, (Axy) (letrec / = Xxy.fy x in f)y))x 
>(x,y, (Axy) letrec / = Xxy.fyx'm /)) (parse2(x,y, (Xxy)y))x 
,(x,y, (Axy) letrec / = Xxy.fyx'm f))yx 
'(x, (Ax)letrec / = Xxy.fyx in /)) yx 
,(()letrec/= Xxy.fy x in /)) yx 



Sparse 4- .@ Axy. (parsed 

parse 1 " .b 

Axy. (parsed 
"Sparse 4- .0 Axy. (parsed 
-^ v .@ Axy. (parsed 
^v.rcd Axy. (parsed 
-"v.nil Axy. (parsed 

Sparse*.® 

Axy. (parsed 

"Sparse 4- .0 

Axy. (parsed 

parse" 1 " .b 

Axy. ( parse j( 
■* P arse+.S Axy. (parsed 



-►yrec Axy. ... yx 

Lemma 5.15. For all closed L e Ter(A| etr ec) the following statements are equivalent: 
(i) L expresses an infinite A-term M, that is, L M . 



V- parse" 



+ M, for some infinite A-term M. 



(ii) parsed (()L) 

(iii) L is S + -productive for some scope + -delimiting strategy S + 

(iv) L is S + -productive for every scope + -delimiting strategy S + . 

Proof. Let L e Ter(X\ etrec ). We show the lemma by establishing the implications in the 
following order: "(|rv|) => ([Hi]) => (jl!)) => @ => (|rv|)" . 
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The implication " (fivj) =>• (jmj) " is clear: (frv|) implies that L is productive for e.g. the 
lazy-unfolding, eager scope + -delimiting strategy on Regl etrec . 

For showing the implication " (Imj) =>■ (Jiij)", let S + be a scope + -delimiting strategy for 
Reg + such that L is S + -productive. Then the strategy S + defines a ^• v . parsc +-rewrite 
sequence r on parseo(O-L) by using § + to define next steps on subexpressions that are 
of the form parse+(. . ., pre n ([xi, . . . ,x n ]P)) in already obtained reducts: if on a term 
(Xxi, . . . ,x n )P the strategy S + prescribes a -^ w -step, then this step is taken over in r; if 
§ + prescribes a -^A,s+-step, then r can continue with a ^-parso+.A-step; if S + prescribes a 
_! "@o,s + ~ an d a ^@i,S + -step, then r can continue with a -^ par sc+,<a-step. For the construction 
of t, possible steps in subexpressions parse+(. . ., pre n ([xi, . . . ,x n ]P)) at parallel positions 
have to be interleaved to ensure that the reduction work is done in an outermost-fair 
way. Productivity of S + on L ensures that always after finitely many steps inside a 
subexpression parse+(. . ., pre ra ([a;i, . . . ,x n ]P)) the function symbol parse^ disappears at 
this position (either entirely, or it is moved deeper over a A-abstraction or an application). 
In the terms of the rewrite sequence r larger and larger A-term contexts appear at the 
head. Hence r is strongly convergent, and it obtains, in the limit, an infinite A-term; thus 
it witnesses r : parse,5(()L) -^. parse+ M. 

For the implication "(jiij) =>• (P" , suppose that r is a rewrite sequence that witnesses 
parse^QL) -^y. parso + M for some infinite A-term M. Since the -^ parso + -steps require 
already unfolded parts of the term, they have to 'shadow' unfolding steps. All -> v -steps in 
t take place beneath symbols parse^. So the possibility of -> parse + -steps during r depends 
on the unfolding steps during r, but not vice versa. Hence a rewrite sequence r' on 
parseQ(()L) can be constructed that only keeps the -> v -steps from r. Since r is strongly 
convergent and converges to M, r' witnesses parseQ(()L) ^ parseQ(()M). By dropping 
the 'non-participant' prefix context parseQ(()rj) from all terms in r', and adapting the 
steps accordingly, a rewrite sequence r" is obtained that witnesses r" ; L ^ M. 

We show the implication "(JT|) (jrvj) " indirectly. So we assume that there is a scope + - 
dclimiting strategy § + such that L is not S + -productive. As in the proof above of "dm]) => 
(jli])", S + defines an outermost-fair, maximal -> v . parso + -rewrite sequence r on parse^QL). 
But since S + here is a strategy that is not productive for L, it follows that, due to its 
construction, r does not succeed in 'pushing' all function symbols letrec to deeper and 
deeper depth, and thereby building up an infinite A-term. Instead, this outermost-fair 
-*- v . parse + -rewrite sequence contains infinitely many steps at the position of an outermost 
occurrence of letrec. Since, other than the -> v -steps, the ->- pa rsc+ -steps (which always take 
place above outermost occurrences of letrec-symbols) cannot be the reason for this, the 
same stagnation of an outermost-fair unfolding process takes place if the -> parse + -steps are 
postponed, that is dropped from r. In this way, by again dropping the 'non-participant' 
prefix context parse^Qrj) from the terms of r, and adapting the steps accordingly, we 
obtain an outermost- fair -> v -rewrite sequence starting on ()L that does not converge to 
an infinite A-term. But then Lemma |3. 15| implies that L does not unfold to an infinite 
A-term. □ 
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(Xxy)Mo (Ax) M (Ax) Mi 



(Xx)\y.M (Ax)M Mi 



(Axi . . .x n -i)M (if the binding 



(Axi ...x n )M Ax n is vacuous) 



Fig. 10. Proof system Tp^ for defining the set of infinite A-terms. 



(A5/i.../„)Mi ... (Ax/ .../n)M„ (A2/ .../„)M 

letrec 

(Ax) letrec /i = A/i, ...,/„ = M„ in M 



Fig. 11. Proof system T( A|etrec ) defined as an extension of by an additional rule 
to define the set of A| e trec-terms 



6. Proving regularity and strong regularity 

In this section we introduce proof systems that are sound and complete for the notions of 
regular, and strongly regular, infinite A-terms. In order to prove soundness and complete- 
ness, we establish, as auxiliary results, a correspondence between scope-delimiting/scope + - 
delimiting strategies for Reg/Reg + and closed derivations in the corresponding proof 
systems. Then we introduce a proof system that is sound and complete for equality 
between strongly regular infinite A-terms. Furthermore, we give two proof systems that 
are sound and complete for the property of A| e trec-terms to unfold to infinite A-terms. And 
finally, we show the following part of our characterisation result: infinite A-terms that are 
unfoldings of A| etrec -terms are strongly regular. 



We start with a more formal definition of A-terms and A| e trec-terms than in Definition 3.3 
by means of derivability in proof system that formalises term decomposition. 

Definition 6.1 (infinite A-terms). We define the set of prefixed infinite A-terms as those 



terms in Ter(E( A )) for which there exists a possibly infinite, completed (see Definition 2.3| 
derivation in the proof system Tpo with axioms and rules as shown in Figure 10 

Ter((A~)) == {M e Ter(E (A) ) | "Vt-, M} 

The set of plain infinite A-terms are those terms that comply with the previous definition 
when equipped with an empty prefix: 

Ter(A~) := {M e Ter(E A ) | "Vt^ ()M e Ter((A~))} 

Definition 6.2 (A| etre c-terms). The set of prefixed A| e t re c-terms comprises those terms 
out of Ter(£( A|etrec )) for which there exists a finite derivation in the proof system T(,\ |etrec ) 
(Figure [H]): 

Ter((A letrec )) := {M e Ter(S (A|arec) ) | h T(w) M) 
The set of plain A| e t re c"t erms are those terms that comply with the previous definition 
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(Xxy)y ° 


(Xxy)Mo ^ 
(AS) Ay. M 


(Xx)Mo (AS) Mi 
(Ax) M Mi 

[{Xx)MT 


(Aa;i ...x„-i)M 


(if the binding 




(Xxi . . . x n )M 


Xx n is vacuous) 


(Xx)M , , , , 
FIX,n (if X»o| > 1) 

{Xx) M 



Fig. 12. The natural-deduction style proof system Reg + for strongly regular infinite 
A-terms, which is an extension of by one additional rule (FIX). In the variant 
system RegQ, instances of (FIX) are subject to the following side-condition: for all 
(Ay) AT on threads in T>q from open marked assumptions ((Xx)M) u downwards it 
holds that \y\ > \x\. 



when equipped with an empty prefix: 

Ter(A, etrec ) := {M e Ter(S w ) | H T(JW , ()M e Ter((A letrec ))} 

Building on rules already used in the proof systems for term formation in A°° and 
(A°°) from the definition above, we now introduce proof systems for regularity and strong 
regularity of infinite A-terms in A°° . 

Definition 6.3 (proof systems Reg, and Reg + , Reg^). The natural-deduction style 
proof system Reg + for recognising strongly regular, infinite A-terms contains the axioms 



and rules as shown in Figure 12 In particular, the rule (FIX) is a natural-deduction 
style derivation rule in which marked assumptions from the top of the proof tree can 
be discharged. Instances of this rule carry the side-condition that the depth |X>o| of the 
immediate subderivation T>o of its premise is greater or equal to 1 (hence this subderivation 
contains at least one rule instance, and, importantly, for a topmost occurrence of (FIX), 
T>q must have a bottommost instance of one of the rules (A), (@), or (S)). 

The variant Rego of Reg contains the same axioms and rules as Reg + , but in it 
instances of (FIX) are subject to the additional side-condition: for all (Xy)N on threads 
in T>o from open marked assumptions ((Xx)M) u downwards it holds that |y| > 

The natural-deduction style proof system Reg for recognising regular infinite A-terms 
differs from Reg + by the absence of the rule (S), and the presence instead of the rule 



(Vacuous) in Figure 13 and by the restriction of the axiom scheme (0) to the more 
restricted version displayed in Figure |13| 

Provability of a term in (A°°) in one of these proof systems is defined as the existence of 
a closed derivation: for Re {Reg, Reg + , Reg^} we denote by \-r (Xx)M the existence 
of a prooftree T> with conclusion M and with rule instances of R such that all marked 
assumptions at the top of the T> are discharged at some instance of the rule (FIX). 



Remark 6.4. The proof system Reg + is related to a proof system for nameless, finite 
terms in the A-calculus that is used in (van Oostrom et al., 2004 sec. 2) as part of a 
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(\Xi . ..Xi-iXi+i . . . X n )M 



Vacuous 



(if the binding 
Xxi is vacuous) 



(Azi ...x n )M 



Fig. 13. The natural-deduction style proof system Reg for regular infinite A-terms 
arises from the proof system Reg + by replacing the rule (S) with the rule 
(Vacuous) for the introduction of vacuous bindings in the A-abstraction prefixes, 
and by replacing the axiom scheme (0) of Reg + by the more restricted version here. 



translation of A-terms into 'Lambdascope' interaction nets, which are used for optimal 
evaluation (in the sense of Levy) of A-terms. 

The proposition below explains that the side-condition on instances of (FIX) from 
the proof systems above to have immediate subderivations T> with |2? | > 1 entails a 
'guardedness' property for threads from such instances upwards to discharged instances. 

Proposition 6.5. Let V be a derivation in Reg, in Reg + or in RegQ possibly with 
open marked assumptions. Then for all instances l of the rule (FIX) in T> it holds: every 
thread from i upwards to a marked assumption that is discharged at t passes at least one 
instance of a rule (A) or (&). 

Proof. Since for Reg + and RegQ the argument is analogous, we only consider derivations 
in Reg. So, let V be a derivation in Reg. Furthermore, let t be an instance of the rule 
(FIX) in T> with conclusion (Xx)M, and let tt be a thread from the conclusion of l upwards 
to a marked assumption {{Xx)M) u . Let k be the topmost instance of (FIX) in T> that is 
passed on tt. By its side-condition, the immediate subderivation of k has depth greater 
or equal to 1, and hence there is at least one instance of a rule (A), (@), or (Vacuous) 
passed on tt above k. If there is an instance of (A) or (@) on this part of tt, we are done. 
Otherwise only rules (Vacuous) are passed on tt above k. But since the rule (Vacuous) 
decreases the length of the abstraction prefix in the term occurrences in a pass from the 
conclusion to the premise, and since the length of the abstraction prefix at the start of tt 
is the same at the end of tt, namely it follows that at least one occurrence of a rule 
that increases the length of the abstraction prefix must have been passed on tt, too, on the 
part from i to k. Since the only rule of Reg that increases the length of an abstraction 
prefix in a pass from conclusion to a premise is the rule (A), we have also in this case 
found a desired rule instance on tt. □ 



Example 6.6. Let M be the infinite unfolding of letrec / = Xxy.fyx in / for which we 
use as a finite representation the equation M = Xxy.Myx This term admits the following 
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derivations in Reg + , where as opposed to the former the latter has some redundacy: 

,,s , ((Xx)Xy.Mxy) u 

{{)M) U — — ^— X 

— — — S M 

(Xx)M ° (Xx)x ,;\ nr S 7T , 

— @ (Xx)M (Xx)x 



(Xx)M 



x 



5 ~ — — (Xx)M 



x 



(Xxy)Mx (^xy)y — — — — — S 



(Xxy) M x y 



(Xxy)Mx {^xy)y 



A (Xxy)Mxy 

(Xx)Xy.Mxy — — A 

77 A (Xx)Xy.M x y 

( M -) — '-^ — FIX, u 

FIX, u (Xx)Xy.Mxy 

()M ()M A 

Note that the derivation on the left is also a derivation in RegQ, but that this is not the 
case for the derivation on the right. There, for the occurrence of (FIX) the side-condition 
in the system RegQ is violated: on the path from the marked assumption ((Ax) Xy.M xy) u 
down to the instance of (FIX) there is the occurrence ()M of a term with shorter prefix 
than the term in the assumption and in the conclusion. See Example |8.5| for a comparison 
of annotated versions of the two derivations above. 



See Example 4.26 for a rewriting sequence in Rey corresponding to the leftmost path 



in both derivations, and also Figure [9] for the corresponding transition graph. 



Remark 6.7 (Reg + versus RegJ). While it will be established in Proposition 6.12 



below that provability in Reg + and RegQ coincides, the difference between these systems 
will come to the fore in annotated versions that are purpose-built for the extraction of 
Aietrec-terms that express infinite A-terms. This will be explained and illustrated later in 
Example |8.5[ using annotated versions of the two derivations in Example |6.6| above. 



Example 6.8. The infinite A-term from Example [T] with the A-transition graph shown 
in Figure [8] is derivable in Reg by the following closed derivation using the notation from 



Example [4T25| 

(Aa)recM(a) 



{{Xb)rec M {b)) u (Ao)a 

Vacuous —77 — 77; — Vacuous 



(Xab)recM(b) (Xab)a 



(Xab)recM(b) a 
(Xa)Xb.recM(b) a 
(Xa)recM(a) 
()Xa.recM(a) 



X 

FIX, u 



When trying to construct a derivation for this term in Reg + from the bottom upwards, 
the rules of Reg + apart from (in first instance) (FIX) offer only deterministic choices, 
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with as outcome an infinite prooftree of the form: 

(Xabcde)recM(e) (Xabc)c 

A 7T : it S 



(Xabcd) Xe.reCM(e) ' (Xabcd)c 

(Xabcd) recM (d) c (Xab)b 
(Xabc)Xd.recM(d) c (Xabc)b 



(Xabc)recM(c) b (Xa)a 

X 7T 77 S 



( Xab) Ac. recM (cjb (Xab)( 



(Xab)recM(b) a 
(Xa) Xb.recM(b) a 



X 



Q\a.rec]Si(a) 

But then, since this prooftree does not contain repetitions, also use of the rule (FIX) in 
order to discharge assumptions is impossible. Consequently, the term is not derivable in 
Reg + . 

For the Reg and Reg + rewriting sequences corresponding to the leftmost paths through 
the two proofs above, see Example |4.25| The corresponding transition graphs are displayed 
in Figure [8} 

Proposition 6.9. Let T> be a derivation in Reg (in Reg + ) with conclusion (Xx)M. 

Then every path in T> from the conclusion upwards corresponds to a ->- rcg -rewrite 
sequence (a -*- reg + -rewrite sequence) from (Xx)M: while passes over instances of (FIX) 
correspond to empty steps, passes over instances of the rule (@) to the left and to the 
right correspond to ->@ - and ^(^-steps, respectively; passes over instances of the rules 
(A) and (Vacuous) (the rule (S)) correspond to ->•>-, and -^-del-steps (->s-steps). The same 
holds for (finite or infinite) cyclic paths in T> that return, possibly repeatedly, from a 
marked assumption at the top down to the conclusion of the instance of (FIX) at which 
the respective assumption is discharged. 

Proof. The proposition is an easy consequence of the following facts: passes from a 
term in the conclusion of an instance i of one of the rules (A), (Vacuous), (S) to the term 
the premise of l correspond to ->-\-, -^dcr, and ->-s-steps, respectively; passes from a term 
in the conclusion of an instance of (@) to the left and the right premise correspond to 
->@ - and ^©j-steps, respectively. □ 



Observe that, for the derivation T> in Example 6.6 the -^• rcg + -rewrite sequences that 
correspond to paths in T> as described in Proposition |6.9| are actually rewrite sequences 
with respect to the eager scope + -delimiting strategy Sg ag for Reg + . This illustrates the 
general situation, formulated by the lemma below: paths in a derivation T> in Reg + (or 
in Reg) from the conclusion upwards correspond to rewrite sequences according to some, 
usually history-aware, scope + -delimiting (scope-delimiting) strategy S for Reg + (for Reg), 
which can be extracted from T>. 

Lemma 6.10 (from Reg/Reg + -derivations to scope/scope + -delim. strategies). 

Let M e Ter(A°°), and let T> be a closed derivation in Reg (in Reg + ) with conclusion 
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l:(Xxy)M lO:(Xx)M ll:{Xx)M x 



l: ( XS: y)y l:(Xx)Xy.M l:(Xx)M M 1 



[l:(Xx)M] u 



l:(Xx 1 ...x n - 1 )M (if Xn does not ~D 



l:(X Xl ...x n )M occur in M) l:(Xx)M ^ v ,.. m ,^ u 

————FlX,u (if Do >1 
I : (Xx)M 



Fig. 14. Proof system Reg + for decorating Reg + -derivations with labels in {0, 1} . 

()M. Then there exists an, in general history-aware, scope-delimiting strategy Sx> for Reg 
(scope + -delimiting strategy Sp for Reg + ) with the following properties: 

(i) Every (possibly cyclic) path in T> from the conclusion upwards corresponds to a rewrite 



sequence with respect to Sp starting on ()M in the sense of Proposition 6.9 where 
passes over instances of the rules (@) to the left and to the right correspond to -»® ,Sj>- 
and t s-n -steps, respectively, and passes over instances of (A) and of (Vacuous) (of 
(S)) correspond to ->a,Sx>-, and -Mel,Sh> -steps (^s,s D -steps), 
(ii) Every rewrite sequence that starts on ()M and proceeds according to §x> corresponds 
to a (possibly cyclic) path in T> starting at the conclusion in upwards direction: 
thereby a ->@ 0) g. D - and -^(g^s-p-step corresponds to a pass over (possibly successive 
(FlX)-instances, or from a marked assumption to the instance of (FIX) that binds it, 
followed by) an instance of (@) in direction left and right, respectively; a -*-\ t § v - or 
-*del,S2,-step (^s,s D -step) corresponds to a pass over (possibly (FlX)-instances and 
assumption bindings to (FlX)-instances) an instance of (A) or (Vacuous) (of (S)), 
respectively. 

(hi) 5T Sd (M) = {(Xy)N | the term (Xy)N occurs in V}. 

Proof. The proof defines a history-aware strategy E>t> for Reg + as a modification of an 
arbitrary (history- free) strategy for Reg + lifted to a labelled version of Reg + . Thereby 
the modification is performed according to a given derivation T>, and the construction 
will guarantee that (ID), {JTTJ) , and ( Jiii] ) hold. We establish the lemma only for the case of 
derivations in Reg + , since the case of derivations in Reg can be treated analogously. So, 
let T> be a derivation in Reg + with conclusion ()M. 

In a first step we decorate V with position labels such that a derivation 2?( lb ) with 

is obtained. Note that 



14 



conclusion e : ()M in the variant proof system Reg + in Figure 
the decoration process can be carried out in a bottom-up manner, where the label in 
the conclusion of a rule instance determines the label in the premise(s) if that is not an 
already labelled term, and where in the case of instances of the rule of (FIX) also the 
labels in marked assumptions are determined. 

In a second step we use the decorated version Z?( lb ) of V to define a history-aware 
strategy Sx> according to which the term ()M can be reduced as 'prescribed' by V^ h \ 
Since the derivations can only determine the strategy Sx> on terms occurring in T>. we also 
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have to define §t> on other terms. This will be done by choosing an arbitrary (but here: 
history-free) scope + -delimiting strategy S for Reg, and basing the definition of §x> on it. 

We start by defining a labelling of Reg + as the ARS for which Su will be defined as a 
history-free strategy, which together with an initial labelling [ then yields a history-aware 
strategy for Reg + . Assuming Reg + = (Ter((A°°)), src, tgt) as the formal representation 
of Reg + , we define the ARS Rey^ := f2tr((A°°)), $, src, tgt) where 

2V((A~)) := {I : (Ay) A \ (Ay) A e Ter((A~)), I e {0, 1}*} 
$ := {(/ : (Ay) A, <j>, I' : (Ay') A') | ^ holds} 

there is an instance of (A), (@), or (S) in Reg + with I: (Ay) A in] 

the conclusion and the term I' : (Ay') A' in the premise, and withj- (6-5) 

4> ■ (Ay) A -»- r eg+ (Ay') A' (one of) the corresponding step(s) in Reg + J 

and where src, tgt : $ -> Ter((A°°)) are defined as projections on the first, and respectively, 
the third component of the triples that constitute steps in <£>. Then the relation 

L :={((Xy)N, I : (Ay) A) | (Xy)N e Ter((A~)), I e {0, 1}*} 
u {(</>, ((Xy)N, 0, (Xy)N)) \ ((Xy)N, (f>, (Xy)N) e $} 

is a labelling of Reg + to Reg + . As initial labelling we choose the function [ that is defined 
by l:Ter((A~)) -* fe~ri(\°°)), (Xx)M ^ e : (Xx)M. and which adds the label V. 
Now we define the strategy Sx> = ( Ter((A°°)), $ on _-p(ib) u ^not-on-xK 111 ) ; src'! tgt ) with 



$on-2>0b) : = {{1 '■ (Ay) AT, 4>, l':(Xy')N'}€^\ Q holds} 

there is an instance of (A), (@), or (S) in 2? (lb) with /: (Ay)A^ in) 

the conclusion and the term I' : (Ay') A' in the premise, and with> (6-6) 

4> ■ (Ay) A ^ rog + (Xy')N' (one of) the corresponding step(s) in Reg + ) 

Kot-ovM ■■= {(I ■■ (Ay) A, 0, r : (Xy')N') e $ | ((63 holds} 



Z : (Xy)N does not occur in 2?( lb ), (/) is a step according to § } (6-7) 

where src', tgt' are the appropriate restrictions of src and tgt. 

Note that, by its definition, §x> is a sub- ARS of Reg + , Now for showing that Sx> is 
a (history-aware) strategy for Reg + , it has to be established that Sd is a history- free 
strategy for the lifted version Reg + of Reg + . For this it remains to show that every normal 
form of is also a normal form of Reg + . So, let I : (Ay) A e Ter((A°°)) be such that it 
is not a normal form of Reg + . Then also (Ay) A is not a normal form of Reg + . We will 
distinguish the cases that Z: (Xy)N occurs on I?( lb ) or not for showing that there is a 
step in Sd with this labelled term as a source. 

For the first case, we suppose that I : (Ay) A does not occur in 2?< lb ). Then there is a 
step 4> '■ ((Ay) A) -»■§ ((Ay') A') in the scope-delimiting strategy S in Reg + , which gives 
rise to the step (f>:(l: (Ay) A) ^ (I' : (Ay') A') in Se/ and in §x>. 

For the second case, we suppose that I : (Ay) A occurs in £>( lb \ and we fix an occurrence 
o. Since by assumption I : (Ay) A is not a normal form of Reg + , o cannot be the occurrence 
of an axiom (0), and hence it is either an occurrence as the conclusion of an instance of 
one of the rules (A), (@), (S) in 2>< lb \ or as a marked assumption in 2?( lb ). If o is the 
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conclusion of an instance t of (A), (@), or (S), then t defines a step on I : (Xy)N which 
also is a step in Sx>. If o is the conclusion of an instance of (FIX) in T>, then we consider 
an arbitrary path tt in 2?( lb ) from o upwards towards a leaf of T>^ h \ Since, due to the 
side-condition of the rule (FIX), immediate sub derivations of instances of (FIX) consist of 
at least one rule application, tt cannot consist merely of applications of (FIX). Hence by 
following 7r from o upwards, after a number of successive instances of (FIX), each of which 
have I : (Xy)N as conclusion and premise, an instance of one of the rules (A), (@), (S) 
follows, which witnesses a step with source (A), ((§>), (S) in in Reg + and in Sx>. Finally, if 
o is an occurrence in a marked assumption at the top of the prooftree 2?( lb ) , then, since 
2?( lb ) is a closed derivation and due to the form of instances of the assumption-discharging 
rule (FIX), there is also an occurrence o' of / : (Xy)N as the conclusion of an instance of 
(FIX) in 2?( lb ). Now the argument above can be applied to the occurrence d to obtain a 
step of Sd on I : (Xy)N. 

By construction Sx> conforms to (jij and Jh]) because of the inclusion of $ n-x>( lb > and 
$ not _ on _-p(ib) respectively; |iii| follows from □ 

Lemma 6.11 (from scope/scope + -delim. strategies to Reg/Reg + -derivations). 

Let M e Ter(X°°), and let S be a scope-delimiting strategy for Reg (a scope + -delimiting 
strategy for Reg + ) such that STs(M) is finite. Then there exists a closed derivation 
T> in Reg (in RegJ, and hence in Reg + ) with conclusion ()M such that the following 
properties hold (note the minor differences with the items (JTJ) , djjl, and ( |iii| in Lemma 6.10 1: 



(i) Every (non-cyclic) path in T> from the conclusion upwards to a leaf of the prooftree T> 
corresponds to a -^g-rewrite sequence starting on ()M where passes over instances 
of the rules (@) to the left and to the right correspond to -»-@ 0j s- and -^(a^g-steps, 
respectively, and passes over instances of (A) and of (Vacuous) (of (S)) correspond 
to -»-a,S-) and ^doi,s-steps (-»s,s-steps); passes from the conclusion to the premise of 
instances of (FIX) correspond to empty steps. 

(ii) Every sufficiently long ->-g-rewrite sequence on (Ai)M has an initial segment that 
corresponds to a (non-cyclic) path in V from the conclusion upwards to a leaf of 
the prooftree: thereby a ->@ 0i s- or -^©^g-step corresponds to a pass over (possibly 
some (FlX)-instances followed by) an instance of (@) in direction left and right, 
respectively; a -*-a,S - or ^doi,s-step (-^s.s-step) corresponds to a pass over (possibly 
some (FlX)-instances followed by) an instance of (A) or (Vacuous) (of (S)), respectively. 

(iii) ST S (M) c {(Xy)N \ the term (Xy)N occurs in V}. 

Proof. We will argue only for the part of the statement of the lemma concerning a 
scope + -delimiting strategy for Reg + , since the case with a scope-delimiting strategy for 
Reg can be established analogously. 

Let M be an infinite A-term, and let § be a scope + -delimiting strategy for Reg such 
that STg,(M) is finite. Now let T> be the (trivial) derivation with conclusion ()M, which, 
in case that this is not an axiom of RegQ (and Reg + ), is also an assumption, and then is 
of the form (()M) U , carrying an assumption marker u. If T> is an axiom, then it is easy 
to verify that the statements and |m] ) hold. 

Otherwise we construct a sequence T>i, T> 2 , ■ ■ ■ of derivations where each T> n satisfies 
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the properties (JTTJ) , and ( pTi| , terms in marked assumptions are not also terms in axioms 
0, and where T> n+ i extends T> n by one additional rule instance above a marked assumption 
in T> n : For the extension step on a derivation T> n , a marked assumption ((Xy)N) u in T> n 
is picked with the property that the term (Xy)N does not appear in the thread down to 
the conclusion of T> n . 

Suppose that the ^g-rewrite sequence from the conclusion of V n up to the marked 
assumption is of the form: 

r : QM = (Xxo)M Q -*g (AS x )Mi — (Xx m )M m = (Xy)N 

Note that, since by assumption (Xy)N is not a term in an axiom of Regg, it follows by 



Proposition 4.8 (JviJ) , that it is not a -> rcg + -normal form. Then depending on whether the 
possible next step(s) in an ->§-rewrite that extends r by one step is a -*-dei,§-step, 
or either a -^<a ,s- or a -►g^g-steps, the derivation T> n is extended above the marked 
assumption ((Xy)N) u by an application of A, S, or @, respectively. For example in the 
case that r extends by one additional step to either of the two rewrite sequences: 

n : QM = (Xxq)M ... -* s (Xx m )M m = (Xx m )M mfl M mA -*-© 4 ,s {Xx m )M m ^ 

with i 6 {0, 1}, the derivation T> n of the form: 



{((Xx rn )M mfl M mA ) u ) 

T> n is extended to "D n +i" 

()M 



((Ag m )M m , )"° ((A^ m )M m ,i) Ul 
((Xx m ) M my0 M mA ) 

QM 



for two fresh assumption markers uq and U\ (the angle brackets (...) are used here to 
indicate just a single formula occurrence at the top of the prooftree V n ). If either of 
(Xx m ) M m fi or (A5 m )M m> i is an axiom, then the assumption marker is removed and the 
formula is marked as an axiom 0, accordingly. Note that, if the statements (TilJ) , and 
( pi| are satisfied for V = D n , then this is also the case for T> = V n+ i. Furthermore, terms 
in marked assumptions are not terms in axioms of Regg . 

The extension process continues as long as T> n contains a marked assumption ((Xy)N) v 
without a 'Regg-admissible repetition' beneath it, by which we mean the occurrence o 
of the term (Xy)N on the thread down to the conclusion in 2?, but strictly beneath the 
marked assumption, such that furthermore all terms on the part of the thread down to o 
have an abstraction prefix of length greater or equal to |y|. (Note the connection to the 
side-condition on instances of the rule (FIX) in Regg, and, in particular, that marked 
assumptions with an Regg-admissible repetition beneath it could be discharged by an 
appropriately introduced instance of (FIX) in Regg.) 

That the extension process terminates can be seen as follows: Suppose that, to the 
contrary, it continues indefinitely. Then, since the derivation size increases strictly in every 
step, an infinite prooftree T>°° is obtained in the limit, which due to finite branchingness of 
the prooftree and Konig's Lemma possesses an infinite path 7r starting at the conclusion. 
Now note that due to ([1]), 7r corresponds to an infinite ->-g-rewrite sequence. Due to 
Proposition |4. 29} this infinite rewrite sequence must contain a grounded cycle. However, 
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the existence of such grounded cycle contradicts the termination condition of the extension 
process, because every grounded cycle provides an RegQ-admissible repetition. 

Let T> 7v, for some TV 6 N, be the derivation that is reached when no further extension 
step, as described, is possible. By the construction the statements (JTTJ) , and ( pii| arc 
satisfied for T> = T>n- Furthermore, T>n is a derivation in Reg + and RegQ in which every 
leaf at the top is either an axiom or an assumption ((Xy)N) u marked with a unique 
marker u, and for every such marked assumption in T>n, there is a RegQ-admissible 
repetition strictly beneath it. This fact enables us to modify T> n into a closed derivation in 
Regg by closing all open assumptions by newly introduced applications of the rule (FIX) . 
More precisely, steps of the following kind are carried out repeatedly. A derivation with 
occurrences of a number of marked assumptions ((Xy) N) Ui highlighted together with a 
single occurrence of the term (Xy)N in its interior that indicates the RegQ-admissible 
repetition for the displayed marked assumptions: 

«(W>...«(W> «MW>... ««)«)"> 

^ -^000 

((Xy)N) is modified into: — — - — — FIX, w 

.„ (Oy)TV) 

()M V ™ 
w ()Af 

where w is a fresh assumption marker. In every such transformation step the number 
of open assumptions is strictly decreased, but the properties Qip , QiiJ) , and ([TiT]) (for T> 
the resulting derivation of such a step) is preserved. Hence after finitely many such 
transformation steps a derivation T> in Regg without open assumptions and with the 
properties (JTJ) , (JTTJ> , and ( |iii| ) is reached, and obtained as the result of this construction. □ 

As a consequence of the two lemmas above, derivability in Reg + and in Reg£ coincides. 

Proposition 6.12. For all infinite A-terms M: i- Reg + ()M if and only if i-Regj ()M. 

Proof. The direction "<=" follows by the fact that every derivation in RegQ is also a 
derivation in Reg + . For the direction "=>", let M be an infinite term such that i- Reg + ()M. 



By Lemma 6.10 there exists a scope + -delimiting strategy S + such that ST§+(M) is finite. 



But then it follows by Lemma 6.11 that there is also a closed derivation in RegJ with 



conclusion ()M, and hence that H Reg + ()M. □ 

Now we have assembled all auxiliary statements that we use for proving a theorem 
that tightly links derivability in the proof system Reg with regularity, and derivability in 
Reg + and in RegQ with strong regularity, of infinite A-terms. 

Theorem 6.13. The following statements hold for the proof systems EQ™ and EQ°°. 

(i) Reg is sound and complete for regular infinite A-terms. That is, for all M e Ter(A°°): 

'"Reg ()M if an d only if M is regular. 

(ii) Reg + and Reg^ are sound and complete for strongly regular infinite A-terms. That 
is, for all M € Ter(A°°) the following assertions are equivalent: 
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P re n+1 ([Zl ■ • • X ^V\ V) = P re n+1 (I>1 • • • 2 ""] M ) 

pre„ ( [xi . . . x n ] s) = pre„ ([zi...z n ]t) (if y does not occur free in s, 

pre n+1 ([xi . . . x n y] s) = pre n+1 ([zi . . . z n w] t) and w does not occur free in t) 

P re n+i([^i ■ ■ ■ Xny] s) = pre n+1 ([zi . . . z n u]t) 
pre„([>i . . . x n ]abs([y] s)) = pre n ([zi . . . z„] abs([it] t)) 

pre n ([a:i ...x n ]s ) = pre n ([yi . . . y n ] to) pre n ([a;i . . . x„] si) = pre n ([yi . . . y n ] ti) 
pre„([ii...i„]s si) = pre n ([yi...2/„]t tx) 

Fig. 15. Proof system EQ^ for equality of preterms in (A°°) modulo = a . 

{\x)M = (Xz)N (if y does not occur in M, 

(^xy)y = (Xzu)u (\xy)M= (Xzw)N and w does not occur in N) 

(Xxy)M = (\zu)N (Xx) M = (Xy) N„ (Ax) Mi = (\y)Ni 

(Xx)Xy.M = (Xz)Xu.N A (Xx)M M 1 = (Ay)-/V iVi 

Fig. 16. Proof system EQ°° for equality of terms in (A°°) in informal notation. 



(a) M is strongly regular. 

(b) H Reg+ ()M. 

(c) H Reg+Q ()M. 

Proof. Since the proof of statement ^ of the theorem can be carried out analogously 
(taking into account Proposition 6.12[ ), we argue here only for statement 

in M, let M be an infinite A-term that is regular. Then there exists a scope- 



For 



delimiting strategy S on Reg such that STg(M) is finite. By Lemma 6.11 it follows 
that there exists a closed derivation T> in Reg with conclusion ()M. This derivation 
witnesses i- Reg ()M. For "<=" in (pi), suppose that i-R eg QM. Then there exists a closed 



derivation V in Reg with conclusion ()M. Now Lemma 6.10 entails the existence of a 
scope-delimiting strategy S in Reg + such that, in particular, STg(M) is finite. This fact 
implies that M is regular. □ 

For defining, shortly, of a proof system for equality of strongly regular infinite A-terms, 
we first give a specialised version of the proof system A°° for cv-equivalence of iCRS- 
prcterms for preterms, and a corresponding system on terms, in (A°°). 

Definition 6.14 (proof systems EQ™ , EQ°°). The proof system EQ™ for a-equi- 



valence of infinite preterms in (A°°) consists of the rules displayed in Figure 15 The 
proof system EQ°° for equality of infinite terms in (A°°) consists of the rules displayed in 
Figure 16 Provability in EQ™ and in EQ°° is defined, analogously to the proof system 
A°° from Definition 2.3 as the existence of a completed (possibly infinite) derivation, and 
will, as was first done so for A°° in Definition |2.2| be indicated using the symbol °°K 
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Proposition 6.15. The following statements hold for the proof systems EQ™ and EQ°°. 

(i) EQ™ is sound and complete for = a on (A°°) -preterms. That is: 

°°i-£q~ pre (s) = pre (i) if and only if s = a t. 

holds for all closed preterms s and t in (A°°). 

(ii) EQ°° is sound and complete for equality between (A°°)-terms. That is: 

°V EQ - M = N if and only if M = N . 

holds for all terms M and N in (A°°). 

Proof. For statement ^ it suffices to show that, for an equation pre (s) = pre (i) 
between preterms of (A°°). dcrivability in EQ™ coincides with derivability of this equation 



in the general proof system A°° for a-equivalence between iCRS-preterms in Definition 2.3 
Given a derivation 2?°° in EQ™ , a derivation T>f° in A°° results by replacing each formula 
occurrence pre n ([xi . . .x n ]s) = pre n ([yi . . .y n ]t) by the formula occurrence {x\ . . .x n }s = 
{yi ■ ■ ■ Un}t and adding an instance of the rule for the function symbol pre at the bottom. 
Then instances of the axioms and rules (0), (@), (A), and (S) in 2?°° correspond to 
instances of axioms and rules (0), (app), ([]), and (S) in 2?f, respectively. This proof 
transformation also has an inverse. 

For "<=" in statement (jTiJ) it suffices to note: Every scope + -delimiting strategy on Reg + 
can be used to stepwisely extend finite derivations in EQ°° with conclusion ()M = ()M 
by one additional rule application above a leaf containing a formula (Xy)N = (Xy)N 
that is not an axiom 0, which implies that (Ay) TV is not a normal form of -^ r cg+- If these 
extensions are carried out in a fair manner by extending all non-axiom leafs at depth n 
in the prooftree before proceeding with leafs at depth > n, then in the limit a completed 
derivation in EQ°° is obtained. 

For "=>" in statement (JiT]> , suppose that 2?°° is a completed derivation in EQ°° with 
conclusion ()M = ()N. Let pre (s) and pre (i) be preterm representatives of ()M and 
()N, respectively. Now a completed derivation 2?^ cr in EQ™ can be found by developing 
it stepwisely from the conclusion pre (s) = pre (i) upwards, parallel to 2?°°, and following 
the rules of EQ™ , which are invertible (that is, the premise/s of a rule instance is/are 
uniquely determined by the conclusion). Then 2?^ cr is a preterm representative version 
of 2?°°. By using (BJ, it follows that s = a t. Since s and t are preterm representatives of M 
and N, respectively, M = N follows. □ 

Definition 6.16 (the proof system Reg*). The natural-deduction-style proof system 
Reg* for equality of strongly regular, infinite A-terms has all the rules of the proof system 



EQ°° from Definition 6.14 and Figure 16 and additionally, the rule (FIX) in Figure 17 But 
contrary to the definition in EQ°°, provability of an equation (Xx)M = (Xx)N in Reg* 
is defined as the existence of a finite closed derivation with conclusion (Xx)M = (Ax) TV. 

Theorem 6.17. Reg* is sound and complete for equality between strongly regular, 
infinite A-terms. That is, for all strongly regular, infinite A-terms M and N it holds: 

i-Regi M = N if and only if M = N. 
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[(Xx)M = (Xy)Nr 

(Xx)M = (Xy)N 
(Xx)M = (Xy)N 



FJX,u (if |X>o| > 1) 



Fig. 17. The rule (FIX), which is added to the rules of EQ°° from Figure 16 
order to obtain the proof system Reg* for equality of strongly regular infinite 
A-terms. 



! [ ( A./ ) cr r | J. , „ i \(X.r)c f | }. , „ 

T>i V n+1 

{ (^)A;[/ : = 2f] }j=l,...,n (A5)L„+i[/:=C / ] 

FIXletrec, til , . . . , U„ 

(Ax) letrec fx = Lx .../« = L n in L n+1 

where c/ x , . . . , c/ n are distinct constants fresh for Li, . . . , L n+ \ , and substitutions 
L;[/ := cj] stands short for Li[f 1 := c/j, . . . , /„ := C/„]. 

side- conditions: \y\ > |S| holds for the prefix length of every (Xy)N on a thread 
in Dj for 1 < j < n+ 1 from an open assumptions ((Xx)Cf i ) Ui downwards; 
for bottommost instances: the arising derivation is guarded on access path cycles 



Fig. 18. The rule (FIX| etrec ) for the natural-deduction style proof systems Reg| etrec 
and Reg^ trec on A| e trec-terms. 



Sketch of the Proof. Let M and N be strongly regular, infinite A-terms. In view of 
Proposition 6.15| it suffices to show: 



M = N if and only if °V Ecr M = N. (6.8) 



For showing "<^" in ( [678] ), let 2?°° be a derivation in EQ°° with conclusion ()M = ()N. 
Since paths in 2?°° correspond to -> reg + -rewrite sequences, and since the number of 
generated subterms of both M and TV are finite (as a consequence of their strong 
regularity), on every infinite thread equation repetitions occur. These repetitions can be 
used to cut all infinite threads by appropriate introductions of instances of (FIX) in order 
to obtain a finite and closed derivation in Reg* with the same conclusion. 



For showing "=>■" in (6.8), let T> be a closed derivation in Reg* with conclusion 
()M = ()iV. Now T> can be unfolded into an infinite derivation 2?°° in EQ°° by repeatedly 
removing a bottommost instance of (FIX) and inserting its immediate subderivation above 
each of the marked assumptions the instance discharges. If this process is organised in a 
fair manner with respect to bottommost instances of (FIX), then in the limit an infinite 
completed prooftree with conclusion ()M = () N in EQ°° is obtained. For productivity 
of this process it is decisive that the side-condition on every instance u of the rule (FIX) 
guarantees that on threads from the conclusion of i to a marked assumption that is 
discharged by t at least one instance of a rule different from (FIX) is passed. □ 
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For the purpose of the following definition and the respective proof system in Figure 18 
we extend the signature £;v letr( , c °f ^- e gietrec an d Regfetrec by an infinite set of constants for 
which we use the symbol c as syntactical variables which frequently carry index subscripts. 

Definition 6.18 (proof systems Reg letrec , Regf etrec , and ug-Reg letrec , ug-Regf etrec ) . 

The proof systems Regj^ trec and Reg| etrec for A| etre c-terms arise from the proof systems 



Reg + and Reg (see Definition 6.3 Figure 12 and Figure 13), respectively, by replacing the 



terms in the axioms (0), and rules (A), (@), (S) and (Vacuous) through A| etre c-terms with 
abstraction prefixes accordingly, and by replacing the rule (FIX) with the rule (FIX| etrec ) 
in Figure [18] The side-condition concerning access path cycles on the derivation arising 
by an instance of (FIX| etre c) pertains only to bottommost occurrences of this rule, and is 
explained below. By (FIX| etrec ) we mean the variant of the rule (FIX| etrec ) in which the 
side-condition concerning guardedness of the arising derivation on access path cycles has 
been dropped. By ug-Regi^^/ug-Reg^^ we denote the variants of Reg^^/Reg^,-, 
respectively, in which the rule (FIX| etre c) is replaced by the rule (FIX^^). 

Let T> be a derivation in one of these proof systems. By an access path of T> we mean a 
(possibly cyclic) path 7r in T> such that: 

(a) 7r starts at the conclusion and can proceed in upwards direction; 

(b) at instances of (@), tt can step from the conclusion to one of the premises; 

(c) at instances of (FLX| etrec ), 7r can step from the conclusion to the rightmost premise 
(which corresponds to the in-part of the letrec-term that is parsed by this instance); 

(d) when arriving at a marked assumption ((A5)c/ i ) Iti that is discharged at an application 



of (FIX| e t r ec) of the form as displayed in Figure 18 tt can step over to the conclusion 
(Xx)Li[f := cj] of the subderivation T>i of that application of (FIX| etrec ), and proceed 
from there, again in upwards direction. 

For every formula occurrence o in T>, by a relative access path from o we mean a path 
with the properties |b|-([d| that starts at o and proceeds in upwards direction. An access 
path (or relative access path) in T> is cyclic if there is a formula occurrence in D that is 
visited more than once. 

Now we say that T> is guarded on access path cycles if every cyclic access path contains, 
on each of its cycles, at least one guard, that is, an instance of a rule (A) or (@). We say 
that T> is guarded if every relative access path contains a guard on each of its cycles. 

Example 6.19. The A| etrec -tcrm L\ = letrec / = /, g = Xx.x in Xy.g admits the following 
closed derivation 2?i in Reg| etrec /Reg^ trec : 



U-2 



-tt i- Vacuous/S 



(Xx)x (Xy)c a ^ 



(() C/ r Q*x.x ()Xy.c s 



FIX| etrec ,ui,u 2 



() letrec / = /, g = Xx.x in Xy.g 

This derivation can be built in a straightforward way, from the bottom upwards. Note 
that T> is guarded on access path cycles, and hence that the instance of (FIX| etrec ) at the 
bottom is a valid one, because: T> does not possess any cyclic access paths. In particular, 
there is no access path in T>i that reaches the first premise of the instance of (FIX| etrec ): 
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this premise is the starting point of an unguarded relative access path, which entails that 
T>i itself is not guarded. 

Now consider the A| e trec-term L2 = letrec / = /, g = Xx.x in Xy.f g. When trying to con- 
struct a derivation in Reg| etrec /Regj^ trec for this term in a bottom-up manner, one 
arrives at the closed derivation T>2 in ug-Reg| etrec /ug-Reg^ trec : 



"* '' "* Vacuous/S ^ ^- Vacuous/S 



(A " )c ' (A " ,C - 

(Aar)a: Qy)c/ c g 

(Qc/)" 1 QXx.x ()Ay.c/c g 

7VT7 i — i % ^ — i FIX| etrec ,ui,w 2 (but not (FIX| etrec )!) 

() letrec f = f,g= Xx.x in Xy.f g 

However, T>2 is not a valid derivation in Reg| etrec /Reg^ trec , as the inference step at the 
bottom is an instance of (FIXj^ trec ), but not of (FIX| etrec ), because the side-condition on 
the arising derivation to be guarded on access path cycles is not satisfied: now there is an 
access path that reaches the first premise of the derivation and that continues looping 
on this an unguarded cycle. Since the bottom-up search procedure for derivations is 
deterministic in this case, it follows that QI/2 is not derivable in Reg nor in Reg + . 



Similar as the correspondence, stated by Proposition 6.9, between (possibly cyclic) 
paths in a derivation in Reg and Reg + starting at the conclusion and rewrite sequences 
with respect to -* r eg and -> r cg+ on the infinite term in the conclusion, there is also the 
following correspondence between access paths in a derivation in Reg| etrec and Regj^ trec , 
and rewrite sequences with respect to ->-rog and "^rcg* on the A| etre c-term in the conclusion. 

Proposition 6.20. Let D be a derivation in Reg| etrec or ug-Reg| etrec (in Regj^ trec or in 
ug-Regj^ trec ) with conclusion (Xx)L. 

(i) Then every access path in T> to an occurrence o of a term (Xy)P corresponds to 
a ^ rcg -rewrite sequence (Xx)L -» rcg (Xy) letrec B in P (to a -^ rog + -rewrite sequence 
(Xi)L ^ rcg + (Ay) letrec B in P), where B arises as the union of all outermost binding 
groups in conclusions of instances of (FLX| e trec) below o, and (Xy)P = (Xy)P[f := c?] 
where / is comprised of the recursion variables occurring in B and c j distinct constants 
for / as chosen by V. More precisely: 

(a) a pass over an instance of (FIX| etrec ) corresponds to an empty or -* V -letrec-step, 
dependent on whether the instance is the bottommost (FIX| etrec )-instance or not; 

(b) a pass over an instance of the rule (@) to the left/to the right corresponds to a 
->@ -step/->-<a 1 -step, which, if the application is somewhere above an instance of 
(FIX| e t r ec), has to be preceded by a ^- v .@-step; 

(c) a pass over an instance of the rule (A) corresponds to a ^A-step which, if the 
application is above an instance of (FIX| etre c), has to be preceded by a ->- v .,\-step; 

(d) a pass over an instance of the rule Vacuous (S) corresponds to a ^del-step (->s-step), 
possibly preceded by an application of ^ v . ro d- 

(e) a step from a marked assumption to a premise of a (FIX| etrec )-instances, a step as 
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described in item Q of the definition of access paths, corresponds to an -> v . rcc -step 
followed by a -^ v . rc d-step. 

(ii) L is reduced if every formula occurrence in T> can be reached by an access path. 

Example 6.21. The rewrite cycle in Example |3. 19| that witnesses that the A| etre c-term 
considered there, letrec / = letrec g = f in g in /, is not unfoldable can also be recognised, 



using the statement of Proposition 6.20 from the following derivation in ug-Regj^ trec : 



FlXie-j-rec, V 

()letrecff = /in.g (c f ) u 

() letrec / = letrec g = f in gin / 



letrec 5 



Note that the instance of (FIX^ trec ) at the bottom is not an instance of (FIX| etr ec), since 
it is not guarded (has an unguarded cyclic access path that reaches and cycles on the left 
premise of the instance of (FIX| etr ec)- 

Lemma 6.22. Let V be a closed derivation in ug-Reg| etrec (in ug-Reg^ trec ) with con- 
clusion ()L. Then there exists a scope-delimiting (scope + -delimiting) strategy for 
Reg ]etrec (for Regl etrec ) with the following properties: 

(i) Every access path in T> corresponds to a rewrite sequence with respect to Su starting 



on ()L in the sense of Proposition 6.20 



(ii) Every rewrite sequence that starts on ()L and proceeds according to Sx> corresponds 
to an access path in V with correspondences as described in Proposition |6. 20 



(hi) ST^(L) = {(Ay) letrec B in P I (Ay) letrec B in P arises from an occ. of (Ag^on | 
y ' 15 v ' l v y ' I an access path of V as described m Prop. |6.20| (fl]> J 

As a consequence of that T> is finite, L is Sp-regular. 
(iv) L is Sp-productive <=> V is guarded (i.e. V derivation in Reg| etrec (Reg^ trec )). 

Proof. Given a closed derivation V with conclusion ()L (for example) in ug-Regj^ trec , 
a scope + -delimiting strategy for Regf etrec such that |r|-([iv| hold can be extracted from 
T> similar as in the proof of a scope + -delimiting strategy S-p in Reg + was extracted from a 
closed derivation in Reg + . That the extracted strategy §x> is productive/not productive 
for L if 2? is guarded/not guarded can be seen by the fact that Su-rewrite sequences 



correspond to access paths of V in the sense as stated by Proposition 6.20 □ 



Now we will prove that derivability in ug-Reg| etrec /ug-Reg^ trec is guaranteed for all 
Aietrec-terms, and that derivability in Reg| etrec /Reg|^ trec is a property of a A| e trec-term that 
is decidable by an easy parsing process. 

Proposition 6.23. The following statements hold: 

(i) For every A| e trec-term L, ()L is derivable both in ug-Reg| etrec and in ug-Reg|^ trec . 

(ii) For every Ai e trec-term L, derivability of ()L in Reg|^ trec is decidable in at most quadratic 
time in the size of L. 

Proof. For <JTJ) note that for every A| etrec -tcrm L 1 a closed derivation T>^ with conclusion 
()L in ug-Reg^ trec can be produced by a bottom- up construction following the term 
structure of L. Hereby use of the rules (S) can be restricted to instances immediately below 
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marked assumptions such that, viewed from a (non-cyclic) path tt from the conclusion 
upwards to a marked assumption, these (S)-instances are only introduced to shorten 
the frozen abstraction prefixes by all A-abstractions that have become frozen on tt (in 
order to conform to the side-condition on (FIX| etrec )-instances to have the same frozen 
abstraction prefix lengths in the discharged marked assumptions as in the conclusion and 
in the premises). 

Now for <JT±J) in order to decide derivability of QL in Reg^ trec , it suffices to decide 
whether the derivation T>l in ug-Regj^ trec obtained as described above, or its bottommost 
instance of (FIXj^ trec ) if there is any, is guarded on all of its access path cycles. (Note 
that in the construction of T>l only the freedom in placing instances of (S) has been used 
in a certain, namely lazy, way. The specific placement of instances of these rules does 
not interfere with the existence or non-existence of guards, that is instances of A or @ 
on cycles of access paths.) For this it remains to check whether every cycle on an access 
path in T>l has a guard. This can be done by exploring the prooftree of T>l according to 
all possible access paths (until for the first time a cycle is concluded) and checking for 
the existence of guards on cycles. 

□ 

We now can prove soundness and completeness of the proof system Regj^ trec for the 
property of A| e trec-terms to unfold to infinite A-terms. 

Theorem 6.24. Reg^ trec is sound and complete for the property of A| e trec-terms to unfold 
to an infinite A-term. That is, for every term L e Ter(A| etrec ) the following statements are 
equivalent: 

(i) L expresses an infinite A-term. 

(ii) ^Re g + trec QL. 

Proof. For the proof of both directions of the equivalence, let L e Ter (A| etr ec)- 
For showing the implication ^ => ^ , we argue by contraposition, and therefore assume 
that QL is not derivable in Regj^ trec . Then, while ()L is not derivable in Reg^ trec , there 



is, by Proposition 6.23 m, a derivation V in ug-Regj^ trec that is not guarded. It follows 



by Lemma 6.22 and in particular due to its item (hvj), that there is a scope + -delimiting 



strategy for Reg such that L is not Sx>-productive. Then it follows by Lemma 5.15 
using @ =>• (pv|) there, that L does not unfold to an infinite A-term. 



For showing the implication (JiTJ) =>• M, let V be a closed derivation in Reg^ trec with 



conclusion QL. It follows by Lemma [6.22 that there is a scope + -delimiting strategy S for 
Reg such that L is S-productive. Then Lemma |5.15| implies that L unfolds to an infinite 
A-term. □ 

Remark 6.25 (soundness and completeness for Reg| etrec ). Also the proof system 
Reg| etrec can be shown to be sound and complete for the property of A| e trec-terms to 
unfold to infinite A-terms. To establish this in analogy with the route of proof we pursued 



here, a CRS Parse similar to Parse* (see Definition 4.301 could be defined by replacing 
the rule (gp arse +) by a rule (£>pa,! se +) that can compress more abstraction prefixes, similar 
as the rule (f?reg) °f R e 9 can compress more abstraction prefixes than the rule (f?r e g+) 
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of Reg* . Then furthermore also a lemma analogous to Lemma 5.15 can be formulated, 
proved, and used in a similar way. 

We now arrive at a theorem that states one direction of our main characterisation 
result (Theorem 8.12 in Section [8| that will link A| e trec-expressibility to strong regularity 
of infinite A-terms. 

Theorem 6.26. Every A| e t re c-expressible, infinite A-term is strongly regular. 

Proof. Let M be an infinite A-term that is expressible by a A| e trec-term L, that is, 
L M holds. Then by Theorem 
conclusion ()L. Now Lemma 



6.22 



6.24 



such that L is Sp-regular. Then Lemma 



Reg 



letrec 



such that M 



there exists a closed derivation T> in Regj^ trec with 
guarantees a scope + -delimiting strategy Sx> for Regl etrec 
l gives an scope + -dclimiting strategy §x> for 
U(L) is §>x)-regular. It follows that M is strongly regular. □ 



7. Binding Capturing Chains 

In this section we develop a characterisation for strong regularity of an infinite A-term by 
means of a property of the 'binding-capturing chains' occuring in the term. This concept is 
related to the notions of scope and scope + as explained informally at the start of Section [4] 
Binding-capturing chains occur whenever scopes overlap, and they are contained within 
scopes + . First we give definitions for the concepts involved: binding, capturing, and bin- 
ding-capturing chains. Then we show that strong regularity is equivalent to the absence 
of infinite binding-capturing chains. 

We will define binding and capturing as relations on the positions of a A-term. Binding 
relates an abstraction with the occurrences of the variable it binds. If p is the position 
of an abstraction (Ax.. . .) that abstracts over x and q is the position of an occurrence 
of x that is bound by the abstraction, then we will write p °— q and say that p 'binds' q. 
Capturing relates an abstraction with variable occurrences that are free in it. If p is the 
position of an abstraction, and q > p is the position of a variable that is free in the entire 



subterm at position p, then we will write p *- q and say that p 'captures' q. See Figure 19 
for an illustration of these concepts. 

When we speak of positions in A-terms (and thus iCRS-terms) we act on the assumption 



that positions on iCRS-terms are an established concept as for example in (Ketema and 



Simonsen, 2011). Note, however, that we deviate slightly from the scheme there in 



addressing the arguments of an app by and 1 instead of 1 and 2. 



Binding-capturing chains have been used in (Endrullis et al., 2011) to study a-avoiding 



rewrite sequences in a rewrite calculus for /i-unfolding. They originate from the notion 



of 'gripping' due to (Mellies, 1996), and from techniques developed in (Oostrom, 1997) 
concerning the notion of 'holding' of redexes (which is shown there as being 'parting' for 
CRSs, that is, never relating two residuals of the same redex). 

We now define 'binding' and 'capturing' formally as binary relations on the set of 
positions of infinite A-terms. 

Definition 7.1 (binding, capturing). For every M e Ter(\°°) we define the binary 
relations °— and -* on the set Pos(M) of positions of M: 
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Fig. 19. The term graph from Example 1 1 . 1 1 with its overlapping scopes (left), its 
nested scope + s (middle), and with indicated binding <>— and capturing -> links 
(right). 



(i) We say that a binder at position p binds a variable occurrence at position q, symbolically 
p °— q, if p is a binder position, and q a variable position in M, and the binder at 
position p binds the variable occurrence at position q. 

(ii) We say that a variable occurrence at position q is captured by a binder at position p, 
symbolically q -* p (and that a binder at position p captures a variable occurrence at 
position q, symbolically p *- q), if q is a variable position and p < q a binder position 
in M, and there is no binder position go in M with p < qa and qo °— q. 

Definition 7.2 (binding— capturing chain). Let M be an infinite A-term. A finite or 
infinite sequence (po,pi,p2, ■ ■ ■) in Pos(M) is called a binding- capturing chain in M if po 
is the position of an abstraction in M, and the positions in the sequence are alternatingly 
linked via binding and capturing, starting with a binding: pq ^- p\ -* pi <>- . . .. 

Binding-capturing chains are closely related to the notion of scope and scope + . In order 
to establish this, we first give precise definitions of the notions of scope and scope + in 
terms of an 'in-scope' rewrite relation on the positions of a A-term: While the scope of 
a binder position p is the set of positions between p and variable positions bound at p 
(the positions directly reachable by a single 'in-scope' step) , the scope + of p is the set of 
positions reachable by a finite number of successive 'in-scope' steps. 

Definition 7.3 (scope and scope + ). Let M be an infinite A-term. On the set Pos(M) 
of M, the in-scope rewrite relation -» sc (for M) is defined by: 

( p a binder position 1 
P^scQ \ A / q , cP = , A . .,\ (for all p,q£ Pos(M)) 

[ a (Bp e Pos(M)) p °— p a p < q < p ) 

where <^ = denotes the reflexive closure of the binding relation ^- (for M). For every 
position p e Pos(M), the scope of p in M and the scope + of p in M are defined as the 
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following sets of positions in M: 

scope M (p) := {q e Pos(M) \ p ^ sc q} scope + M (p) := {q e Pos(M) \ p -^ + sc q} 

(Note that the scopes and scope + s of non-binder positions are empty sets of positions.) 

Now the following proposition establishes that binding-capturing chains starting at a 
binder position p span the space of positions of the scope + of p. 

Proposition 7.4. Let M be an infinite A-term. Then for all positions p, q € Pos(M) the 
following statements hold: 

(i) p -+ sc q A q is a binder position <=>• (p = q binder position) vp<^--»g. 

(ii) p 1 (p binder pos.) a (3p' e Pos{M))p (°— ■-*)*■ <^ = p' a p < q < p' . 

(iii) scope M {p) := {q e Pos(M) | (p binder pos.) A (3p r e Pos(M)) p <^ = p' A p<q<p'}. 

(iv) scope + M (p) := {g e Po S (M) | (^p^S))^- ■ ->)* ■ ^ p' a p < q < p'Y 

Conversely, positions between a binder position p and a position p n on a binding-cap- 
turing chain starting at p n are in the scope + of p n . 

Proposition 7.5. Let (po,Pi,P2, • ■ ■) be a binding-capt. chain in an infinite A-term M. 
Then it holds that p a < p 2 < ■ ■ ., and p a < pi, p 2 < Ps, .... Furthermore, for all q such that 
Po < q< p n for some n € N with p n a position on the chain it holds that q e scope + M {p ) . 

In order to study the relationship between rewrite sequences in Reg + and binding-cap- 
turing chains we first introduce a position-annotated variant of Reg + . 

Here the idea is that, when a prefixed term (Xyi ...y n )N is obtained as a generated 
subterm of an infinite A-term M by a -^ rc g or -^ rcg + rewrite sequence r on ()M, then in 
the position-annotated rewrite system a prefixed term (Ayi, . . . ,y n ) Pl Pn N is obtained 
by an annotated version r pos of the rewrite sequence r such that: the positions p\, . . . ,p n 
are the positions in (the original A-term) M from which the bindings Aj/i , . . . , Xy n in 
the abstraction prefix descend, and q is the position in M of the body TV of the subterm 
generated by r. 

On Ter((A°°)) we consider the following rewrite rules in informal notation: 

(Azi . . . x n ) q pi jpn M M l - (Azi . . . *„)£,...,„„ M, (i e {0, 1}) 

(Aa;i . . .i„)^ pn Ay.M -»■ (Azi . . . x n y) q pi> pnq M Q 
(Xx 1 . . . x n+1 ) q pi iPn+i M -»• (Axi . . . x n ) q pu ^ pn M 

(if the binding Ax„ + i is vacuous) 
(Azi . . . „ Pn+1 M -+ (Azi . . . x^ x x i+1 . . . x n+1 ) q pu ... Pi _ uPM ... Pn+1 M 

(if the binding Axj is vacuous) 

Note that the change of the term-body position in a A-decomposition step is motivated by 
the underlying CRS-notation for terms in (A°°): when a term abs([y]M ) representing 
a A-abstraction starts at position q, then its binding is declared at position qO, and its 
body M starts at position qOO. 



\@pos) 
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Definition 7.6 (position-annotated variants Reg pos and Reg pos ). The CRS-sig- 
nature for (A) pos , the A-calculus with position-annotated abstraction prefixes is given 
by S(a) pos = S A u {pre^ pi )J)n> | pi,... ,p n ,qt {0, 1}*} where all of the function symbols 
p> re (pi p , are unary. We consider the following CRS-rules over S(^) poa : 





K „>(>•'••• 


a: n ]app(M ,M 1 )) - 


qi 


• >P« 


} ([xi.. 


■X n ]Mi) 


(epos) : 


P re ( pl ,...,p„)(^l- 


.a:„]abs([y]Mo)) - 


gOO 

* pre L. 


-,P„ 




■ ■x n y]M ) 


(p S p OS ) : 


q 

P re {pl,...,p„ + 1 


([xi . . .x n+ i]M ) - 




■,Pr 


} ([xi.. 


.x n ]M ) 


/ „dcl \ , 


pre? 


([xi . ..ac n+1 ]Mo) - 














q 

P re (pi,...pi-i,Pi+i...p 7 




. X 


i-l%i+l ■ 


..x n+ i]M ] 



By Reg' pos we denote the CRS with the rules £Cj s and g pos . By Reg pos (Reg pos ) we denote 
the CRS consisting of all the above rules except the rule g p f s (g pos )- 

By Reg pos , Reg pos and Reg pos we denote the infinite abstract rewriting systems (ARSs) 
induced by the iCRSs derived from Reg~ pos , Reg pos , Reg + , restricted to position-annotated 
terms in Ter((A°°)). 

By drop pos we denote an operation that drops the position annotations in CRS-terms. 

Proposition 7.7. The following two statements hold: 

(i) Every rewrite sequence: 

t : (Xxq)Mo ->> (A5i)Mi ->> ... -+ T (Xx n ) M n (7.9) 

(with r e {reg~, reg, reg + }) in Reg~ , Reg, or Reg + can stepwisely be transformed (lifted), 
for given qo e N* and po e N* with |po| = \%o\, by adding these and appropriate further 
position annotations qi, . . . , q n e N* and pi, . . . ,p n £ N*, to a rewrite sequence: 

r pos : (Xx )f M Q - P (Xxi)^ Mi - r . . . - P (A2„)£ M n (7.10) 

(with r e {reg - , reg, reg + }) in Reg pos , Reg pos , or Reg pos , accordingly, such that the 
result of dropping the position annotations in the prefix of f is again t. 

(ii) Conversely, every rewrite sequence £ in Reg pos , Reg pos , or Reg pos of the form (7.101 
(with r e {reg - , reg, reg + }) can stepwisely be transformed, by dropping the posi- 
tion annotations in the prefix, to a rewrite sequence £ of the form (7.10) (with 
r e {reg~, reg, reg + }) in Reg' , Reg, or Reg + , respectively. 

The transformations in ^ and (JTTJ) preserve eagerness/laziness of rewrite sequences. 

As a direct consequence we obtain, for the eager and lazy scope-delimiting (scope + - 
dclimiting) strategies, the following direct correspondence between generated subterms in 
the ARS Reg (Reg + ) and in the position-annotated version Reg pos (Reg pos ). 

Proposition 7.8. For all infinite A-terms M it holds: 

(i) ST S (M) = drop pos (ST pos (M)) for the strategies § e {§ cag , §i azy } on Reg, Reg pos . 

(ii) ST + S (M) = drop pos (ST + s ' pos (M)) for the strategies § e {S c + ag , §+ zy } on Reg, Reg pos . 

The proposition below characterizes the binding relation <>— and the capturing relation 
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-> on the positions of an infinite term M with the help of rewrite sequences with respect 
to -> re g- on () e M in Reg' pos down to 'variable occurrences' (Xx)pXi in M. 

Proposition 7.9. For all M e Ter(A°°) and positions p, q e Pos(M) it holds: 

p o— q <=>■ there is a rewrite sequence ()^ M -»- re g- (A:Ei • ■ ■ in)^ p #i 
such that p = Pi 

q -* p <=>• there is a rewrite sequence ()M -»- xe g- (Aa^i . . . x n ) q pi p %i 
such that p e {p l+ i, . . . ,p n ) 

The following lemmas describe the close relationship between, on the one hand, bin- 
ding-capturing chains in an infinite A-term M, and on the other hand, -> rcg + -rewrite 
sequences on () £ M in Reg* os that are guided by the eager scope + -delimiting strategy. 

Lemma 7.10 (binding capturing chains). For all M e Ter°°(A) it holds: 

(i) If () e M -»sj ag (Axi . . . x n ) pi Pn N in Reg* , then there exist q 2 , . . . , q n e Pos(M) such 
that pi °- q 2 -* P2 °- ■ • ■ °- q n -* Pn- 

(ii) If pi °— qi -* pi °— . . . °— q n -* p n is a binding-capturing chain in M, then there exist 
ri, . . . ,r n ,s e Pos(M) such that () e M -»s+ a (Xxi . . . x n ) s ri r ^N and furthermore 
Pl,... ,p n 6 {n, •• -,»"„} such that pi < p 2 < . . . < p n - 

Lemma 7.11. Let M be an infinite A-term such that ()M -»sj ag (Axi . . .x n )N. Then 
M contains a binding-capturing chain of length n — 1. 



Proof. By Proposition 7.7 O, the assumed rewrite sequence ()M -»s+^ (Axi . . .x n )N 
in i?e<7 can be lifted to a rewrite sequence () e M ^sj ag (Azi ...x n ) pi N in Reg pos . 
Then by Lemma 7.10 (|i|, there exists a binding-capturing chain of length n—1. D 

The notion of scope and scope + helps to understand the relationship between binding- 
capturing chains and rewrite sequences in Reg* . A binding-capturing chain corresponds 
to the overlap of scopes, or in other words the nesting of scope + s. An infinite binding- 
capturing chain thus corresponds to a infinitely deep nesting of scope + s and therefore to 
an unrestricted growth of the prefix in certain rewriting sequences in Reg* . 

Lemma 7.12 (infinite binding— capturing chains). Let M be an infinite A-term, and 
let r be an infinite rewrite sequence in Reg w.r.t. the eager scope + -delimiting strategy S* ag : 

t : ()M = (Xx )M - Siag (Xxi)Mi -+ Siag . . . -s^ (Xi^M, -g^ . . . (7.11) 

Furthermore suppose that for p ; N -*■ N, i >-* p(i) := \xj\, the prefix length function 
associated with r, there exists a lower bound lb ; N -> N such that I is non-decreasing, 
and limn^oo lb(n) = oo. Then there exists an infinite binding-capturing chain in M. 

Proof. Let M, r, p, lb as in the assumption of the lemma. We first note that by 
Proposition |7.7| (m), the rewrite sequence r can be lifted to one with position annotations: 



rP° s : () e M = (XxoYM - Scag (Xx^M, -+ Seag . . . - Scag {Xx^M, - Soag . . . (7.12) 
where, for alH € N, qi are positions and pi = (pi , . . . , p mi ) vectors of positions, with m, € N. 
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Next we define the function: 

st : N -> N, !h st(Z) := min {i \ lb(i) > 1} 

which is well-defined, since limn-too lb(n) = oo. It describes a prefix stabilization property: 
for every I e N, it gives the first index i = st(l) with the property that the prefix 
of (Xii)Mi contains more than I abstractions, and (since lb is non-decreasing, and a 
lower bound for p) that from i onwards the Z-th abstraction never disappears again, 
for j > i, in terms (Xxj)Mj that follow in t as well as in r pos . Furthermore, st is non- 
decreasing, as an easy consequence of its definition, and unbounded: if st were bounded 
by M € N, then VZ eN3i eN. i < M a lb(i) > I would follow, which cannot be the case 
since {lb(Q), . . . , lb(M)} is a finite set. By non-decreasingness and unboundedness it also 
follows that lim^oo st{n) = oo. 

So when the rewrite sequence r pos is split into segments indicated in: 

(TM -^ Scag . . . -^ §cag (A2 rt( i))£jJ M sKi) -^ Scag {Xx st{l+1) )f^M st{l+l) ^ Scag . . . 

then it follows that all terms of the sequence after (Ax si (i))! ,t(!) M s t(j,) have an abstraction 
prefix of length greater or equal to i, for all i e N. 

Now note that in a step (Xx)^ pi p ^P -* (Xx')^ p , p , ^P' in Reg pos that does not 
shorten the abstraction prefix it holds that n < n' and p± = p%, . . . , p' n = p n , that is, 
positions in the vector in the subscript of the abstraction prefix are preserved. As a 
consequence it follows for the rewrite sequence r pos that, for all i € N and j > i, the 
position vector p s t(j) in the term (A2 s t(j))^ s< ^ M st ^ is of the following form: 

Pst(j) = \Pl,st{i)i ■ ■ ■ iPi,at{i)iPj,st{j)i ■ ■ ■ TPj.rrij) 

This implies furthermore that for all i e N: 

Pst(i) = (Pl,st(l))P2,st(2))- ■ ■ ,Pi,st(i),- ■ ■ ,Pi,m,i) 



Then Lemma 7.10 dih , implies the existence of positions <72,93, ■ ■ • such that: 



PX,ai(l) ^ 12 -* P2,st(2) o-q3-+ .-.-* Pi,st(i) ^ f?i+l "* Pt+l,at(t+l) °- • • • ■ 

and thereby, an infinite binding-capturing chain in M. □ 

Now we formulate and prove the main theorem of this section, which applies the concept 
of binding-capturing chain to pin down, among all infinite A-terms that are regular, those 
that are strongly regular. 

Theorem 7.13. A regular infinite A-term is strongly regular if and only if it contains 
only finite binding-capturing chains. 



By adding the statement of Proposition 4.22 (HI), we obtain the following accentuation 



Corollary 7.14. A strongly regular A-term is regular, and contains only finite binding- 
capturing chains. Conversely, a regular A-term is strongly regular if it contains only finite 
binding-capturing chains. 
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Proof of Theorem 7.13 . Let M be an infinite A-term that is regular. 
For showing the implication "=>-", we assume that M is also strongly regular. Then there 
exists a scope + -delimiting strategy S such that STg(M) is finite. By Proposition 



4.23 



i|) it follows that then also 5T§ + (M) is finite for the eager scope + -delimiting strategy 
in Reg + . Now let n be the longest abstraction prefix of a term in STg+ (M). Then it 
follows by Lemma [T. 1 1 1 that the length of every binding-capturing chain in M is bounded 
by n - 1. Hence M only contains finite binding-capturing chains. 

In the rest of this proof, we establish the implication u <=" in the statement of the 
theorem. For this we argue indirectly: assuming that M is not strongly regular, we show 
the existence of an infinite binding-capturing chain in M. 

So suppose that M is not strongly regular. Then for all scope + -delimiting strategies S 
in Reg + it holds that STt(M) is infinite. This means that in particular STt+ (M) is 

cag 

infinite for the eager scope-delimiting strategy on R e 9 + ■ It follows that the number 
of -»-g+ a -reducts, and hence the induced sub-ARS (()M-»s+ ) of ()M in Reg + is infinite. 
Since -»•§+ on Reg + has branching degree < 2 (branching actually only happens at sources 
of -steps), it follows by Konig's Lemma that there exists an infinite rewrite sequence: 

r : ()M = (Xx )M -s+ (ASi) Mi -s+ ■ • ■ -+s+ (A2,)M< -s+ ■ • • 



in Reg that passes through distinct terms. By Lemma 4.12 (lij), this rewrite sequence 
projects to a rewrite sequence: 

f : ()M = (Xx' )M Q -^ Soag (Axi)Mi ^ Scag . . . ^ Soag (Xx^Mi -^ §oag . . . (7.13) 

in Reg in the sense that: 

(Axi)Mi -» dd (A2j)Af< (for all * e N) ; (7.14) 

note that, in the terms, the projection merely shortens the length of the abstraction prefix. 
Since M is regular, 5'T§ cag (M) is finite by Proposition 4.23 (jlj), and hence it follows that 
only finitely many terms occur in f. 

Now we will use this contrast with r, and the fact that the terms of r project to terms 
in f via -»dcrP r efi x compression rewrite sequences, to show that the prefix lengths in 
terms of r are unbounded, and stronger still, that these lengths actually tend to infinity. 
More precisely, we show the following: 

(VZ e N)(3i e N) (Vi > i ) [ > I] . (7.15) 

Suppose that this statement does not hold. Then there exists lo e N such that < Iq for 
infinitely many i € N. This means that there is an increasing sequence io < i\ < ii < 13 < ■ ■ ■ 
in N such that: 

S := {{\x %] )M %] I j e N} is infinite (7.16) 
for all (A.r, e S : \x %] \ < l (7.17) 

(S is infinite since the terms on r are distinct). On the other hand we have: 

T := {{Xx'^M^ I j e N} c ST Sci jM) is finite (7.18) 

because M is regular. However, since every term in S has a -»der r educt in T due to (7.14), 
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as well as an abstraction prefix of a length bounded by Iq, it follows by Proposition 4.9 
jiij, that S also has to be finite, conflicting with (7.161. We have reached a contradiction, 
and thereby established (7.15). 



Now we are able to define a lower bound on the lengths of the prefixes in r that fulfills 
the requirements of Lemma |7.12| We define the function: 

lb : N -> N, n« lb(n) := min{|x„/| | n' > n} 

Its definition guarantees that lb is a lower bound on the prefix lengths in t, and that lb 
is non-decreasing. Furthermore also lim^oo Z6(n) = oo follows by non-decreasingness, in 



addition to unboundedness of lb: for arbitrary I e N, by (7.15) there exists n € N such 
that > I holds for all n € N, n > n ; this entails lb(n ) > I. 

Now since M, t, together with as defined above, satisfy the assumptions of Lemma |7.12| 
this lemma can be applied, yielding an infinite binding-capturing chain in M. □ 

Example 7.15. The infinite A-term from Example |1. 1| with a representation as a higher- 
order recursive program scheme in Example |4.25[ which was recognised there to be regular 
but not strongly regular, possesses an infinite binding-capturing chain as indicated on 
the right in Figure [l9| 



8. Expressibility by terms of the A-calculus with letrec 

In this section we finish the proof of our main characterisation result: we prove that every 
strongly regular A-term is A| e t re c-expressible. For this purpose we introduce an annotated 
variant of one of the proof systems for strongly regular infinite A-terms. We show that 
every closed derivation in Regg with conclusion ()M, which witnesses that M is strongly 
regular, can be annotated, by adding appropriate A| etrec -tcrms to each prefixed term in the 
derivation, into a derivation in the annotated system with conclusion () L: M such that 
the Aietrec-term annotation L expresses the infinite A-term M. We show the correctness of 
this construction by transforming the derivation in the annotated proof system into a 
derivation in the proof system Reg* with conclusion ()U(L) = ()M, and then drawing 
upon the soundness of Reg* with respect to equality of strongly regular infinite A-terms. 

We start by introducing a variant of the proof system Regg in which the formulas are 
closed, prefixed, A| e trec-term-annotatcd, infinite A-terms. 

Definition 8.1 (the proof system ann-Regg). The formulas of the proof system 
ann-Regg are closed expressions of the form (Ai) L : M with x a variable prefix vector, 
Xx.L a A]etrec-term, and Xx.M a A-term. The axioms and rules of ann-Regg are annotated 



12 



versions of the axioms and rules of the proof system Regg from Definition 6.3 and Figure 
and are displayed in Figure [20| 

Remark 8.2. As for an example that illustrates why we have chosen to formulate an 
annotated version only of the proof system Regg, but not of Reg + , please see Example 



.5 



The following proposition is a statement that is entirely analogous to Proposition |6.5| 
Proposition 8.3. For all for all instances L of the rule (FIX) in a derivation T> (possibly 
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(Xxy) y.y , , , „ X 



(Xxy) L : M (AS) L : M (Xx) Li : Mi 



(Xx) Xy.L: Xy.M (Xx) Lo Li : Mo Mi 

(Axi . . .x„-i) L : M 



(Xxi ...x n ) L: M 

[(Xx) c u : Mf 
V 



S (if the binding Xx n is vacuous) 



(Xx) L[u := c u ] : M FIX u (if |f o| > 1, and \y\ > \x\ for all (Xy)N on threads 



(AS) (letrec w = L in u) : M ' from open assumptions ((Xx) u : M) u down) 



Fig. 20. Annotated natural-deduction style proof system ann-Regg for strongly 
regular infinite A-terms, a version of Regg with A| etre c-terms as annotations. 



with open assumptions) in ann-RegQ it holds: every thread from i upwards to a marked 
assumption that is discharged at i passes at least one instance of a rule (A) or (&). 

The lemma below states a straightforward connection between derivations in Regg 
and derivations in its annotated version ann-Regg. 

Lemma 8.4 (from Regg- to ann-Regg-derivations, and back). The following trans- 
formations are possible between derivations in Regg and derivations in ann-Regg: 

(i) Every derivation V in Regg with conclusion (Xx)M can be transformed into a 
derivation T> in ann-Regg with conclusion (Xx) L : M such that there is a bijective 
correspondence between marked assumptions ((Xy) M) u in T> and marked assumptions 
((Xy) u: M) u in T>. (As a consequence, T> is a closed derivation if V is closed.) More 
precisely, T> can be obtained from T> by replacing every term occurrence (Xy)N by 
an occurrence of (Xy) P : N for a prefixed A| etrec -term (Xy)P with the property that 
every prefix variable bound in P is also bound in N . Thereby occurrences of marked 
assumptions and axioms in T> give rise to occurrences of marked assumptions and 
axioms in T>, respectively; instances of the Regg-rules A, @, S, and (FIX) in T> give 
rise to instances of ann-Regg-rules A, @, S, and (FIX) in T>, respectively. 

(ii) From every closed derivation T> in ann-Regg with conclusion (Xx) L : M a closed 
derivation T) in Reg with conclusion (Xx)M can be obtained by dropping the 
annotations with A| etre c-terms. 

Proof. Statement Q of the lemma can be established through a proof by induction 
on the depth \D\ of a derivation T> in Regg with possibly open assumptions. In the 
base case, axioms (0) of Regg are annotated to axioms (0) of ann-Regg, and marked 
assumptions ((Xy)N) u in Regg to marked assumptions ((Xy) c u : N) u . In the induction 
step it has to be shown that a derivation T> in Regg with immediate subderivation 2? 
can be annotated to a derivation 2?( lb ) in ann-Regg, using the induction hypothesis 
which guarantees that an annotated version T> of T> has already been obtained. Then 
for obtaining T> from T> the fact is used that the rules in ann-Regg uniquely determine 
the annotation in the conclusion of an instance once the annotation(s) in the prcmise(s) 
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(and in the case of (FIX) additionally the annotation markers used in the assumptions 
that are discharged) are given. In order to establish that instances of S in V give rise to 
corresponding instances of S in T>, the part of the induction hypothesis is used which 
guarantees that the A| e trec-term annotation in the premise contains not more variable 
bindings than the infinite A-term it annotates. 

Statement <JTTJ> of the lemma is a consequence of the fact that, by dropping the A] e trec-term- 
annotations, every instance of a rule of ann-Regg give rise to an instance of the corre- 
sponding rule in Regg. Formally the statement can again be established by induction on 
the depth of derivations in ann-Regg. □ 



Example 8.5. The der ivati on T>i in Regg from Example 6.6 on the left can be annotated 



as described by Lemma 8.4 (|iij), to obtain the following derivation T>i in ann-Regg : 



(() c M : M) u s q 

(Ax) c u ■ M (Xx) x '■ x 

(Xx) c u x : M x 



(Xxy) c u x:Mx (Xxy) y.y 

(Xxy) c u xy.Mxy 
(Xx) Xy.c u xy. M xy 
() Xxy.c u xy : Xxy.Mxy 
() (letrec u = Xxy.uxy in u) : M 



FIX, u 



Note that the term in the conclusion, which has been extracted by the annotation 
procedure, is actually the same as the A| e t re c-term letrec / = Xxy.fyx in / which was used 



in Example 6.6 to define M as its infinite unfolding. 

Furthermore note that, in a variant of ann-Regg in which the 'Regg-addition' (con- 
cerning abstrat ion p refix lengths) to the side-condition of (FIX) is dropped, the derivation 
on the right could be annotated to obtain the following prooftree T> r : 



T> r in Example 



6.6 



((Xx)c u :Xy.Mxy) u 
()Xx.C u :M 

" 



(Xx) Xx.c u : M (Xx) x •■ x 

(Xx) (Xx.c u ) x : Mi 

(Xxy) (Xx.c u ) x ■■ Mx (Xxy) y.y 

(Xxy) (Xx.c u )xy-Mxy 



A 



(Xx) Xy.(Xx.c u )xy.Xy.Mxy ^ ^ ( no instance of (FIX) 
(Xx) letrecu = Xy.uxy in u: Xy.Mxy ' in ann-Reg !) 



() Ax. letrec u = Xy.uxy in u: M 



Observe that, equally as was the case for T> r , also in t> r there occurs, on the thread 
between the marked assumption at the top and the rule instance l at which this assumption 
is discharged, a formula, namely () u- M, that has a shorter abstraction prefix than the 
formula in the premise and conclusion of t as well as in the assumption. Thus i is not an 
instance of the rule (FIX) in ann-Regg. 

Furthermore note that the A| etre c-term extracted by T> r does not unfold to M, and hence 
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{ [(AS) C A : Mi] }*=!,...,„ { [(Ax) c A : M*] }«,...,„ 

{ (AS) [/:=£/] -.Mj }j=i,...,t. (AS) L„ + i [/:=£/] : M„+i 

(AS) (letrec f 1 = L 1 ...f„ = L n in L„+i) : M n+ i 



FIX, 



where , . . . , Cf ra are distinct constants fresh for L\, . . . , L n+ i, and substitutions 
Li[f := Cj\ stands short for L;[/i := c/ x , . . . , /„ := C/ n ]. 

side-conditions: \y\ > |S| holds for the prefix length of every (Xy)N on a thread 
in T>j for 1 < j < n + 1 from an open assumptions ((Ai)c/ i )" i downwards; 
for bottommost instances: the arising derivation is guarded on access path cycles. 



Fig. 21. The proof system ann-Reg^ trec for A| e t re c-terms arises from the proof 



system ann-Regg (see Figure 20 1 by replacing the rule (FIX) with the rule 

(FIX| et 

rec ) • 



does not express M. This example shows that the side-condition on instances of (FIX) in 
ann-Regg cannot be weakened to the form used for the rule (FIX) in Reg + when the 
aim is to extract a A| e trec-term that unfolds to the infinite A-term in the conclusion. 

The central property of the proof system ann-Rego still remains to be shown: that 
the Aietrec-terms in the conclusion of a derivation in this system does actually unfold to 
the infinite A-term in the conclusion. This will be established below in Lemma 18.101 and 
Theorem [811] But as an intermediary proof system that will allow us to use results about 
the proof system Regj^ trec from Section |6j we also introduce an annotated version of the 
rule letrec in Reg^ trec , and an according annotated proof system. 

Definition 8.6 (the proof system ann-Regj^ trec ). The proof system ann-Regj^ trec 
arises from RegJ by replacing the rule (FIX) by the rule (FKietrec) in Figure 



21 



annotated version of the rule (FIX| e trec) from Definition 6.18 and Figure 18 The side- 



condition on bottommost instances of (FIX| etr ec) to be guarded on access path cycles is 
analogous as explained in Definition [6T8j 

Proposition 8.7 (from ann-Regj^ trec - to Reg^ trec -derivations). Let V be a closed 
derivation in ann-Regj^ trec with conclusion ()L:M. Then a closed derivation T> in 
Reg^ trec with conclusion ()L can be obtained by removing the infinite A-terms in T> while 
keeping the A| e trec-term-annotations. 

Proposition 8.8 (from ann-Regg- to ann-Regj^ trec -derivations). Every derivation 
T> in ann-Regg can be transformed into a derivation V in ann-Regj^ trec with the same 
conclusion and with the same open assumption classes. 

Proof. First note that the ann-Regg and ann-Regj^ trec differ only by the specific 
version of assumption-discharging rule in the system, (FIX) in ann-Regg and (FIX| etrec ) 
in ann-Regj^ trec . For showing the proposition, let V be a derivation in ann-Regg. 

We define a prooftree V , (intended to be a derivation in ann-Reg|^ trec ) by repeatedly 
replacing topmost occurrences of (FIX) at the bottom of subderivations of the form as 
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depicted in Figure |20| by simulating subderivations of the form: 
[(Ax) c u : M} u 

(Xx) L[u := c„] : M {{\x)c u :MY 

— — — — — t lA|etrec, U 

{ax) (letrec u = L in u) : M 

until all occurrences of instances of (FIX) have been replaced by instances of (FIX| etr ec)- 
The result is a prooftree with axioms and rules of ann-Reg^ trec +(FIX| etrec ), with the same 
conclusion and the same classes of open assumptions as 2?, but in which rule instances 
carrying the label (FIX| etre c) might actually be instances of (FIX^,...^), unless actually 
proven (as will be done below) to be instances of (FIX| et rec)- 

Now first note that, due to the form of the introduced instances of (FIX| etrec ), every 
formula occurrence in V is reachable on an access path of V . Second, note that relative 
access paths ir' in V starting at the conclusion of an instance i' of (FIX| etrec ) up to 
a marked assumption that is discharged at i! descend from a thread 7r in T> from the 
conclusion of an application l of (FIX) up to a marked assumption that is discharged 



at l. Since by Proposition 8.3 the thread 7r' passes at least one instance of a rule (A) or 
(@), this is also the case for tt. As a consequence, all cycles on relative access paths are 
guarded. Thus V is guarded. Hence all occurrences of rule names (FIX| etrec ) in V rightly 
label occurences of this rule, and V is a derivation in ann-Reg|^ trec , which moreover is 
guarded. □ 



Example 8.9. The closed derivation T>i in Example 8.5 can be transformed into the 
following closed derivation in ann-Regj^ trec : 



(Xx) C u : M ( AX ) x '■ x 

(Ax) c u x : M x 



(Xxy) c u x:Mx (Xxy) y.y 

{Xxy) c u xy:Mxy 
(Xx) Xy.c u xy:Mxy 

A 



()Xxy.c u xy-Xxy.Mxy (()c u :M) u 

; — — t lA| etrec , u 

() (letrec u = Xxy.uxy in u) : M 

Now we concentrate on the remaining matter of proving that the A| e trec-term obtained 
by the annotation process from a closed derivation in Regg to one in ann-Rego docs 
indeed unfold to the infinite A-term it annotates. For this, we establish a proof-theoretic 
transformation from derivations in ann-Regg to derivations in Reg*. 

Lemma 8.10 (from ann-Rego- to Reg*-derivations). Let V be a closed derivation 
in ann-Regg with conclusion () L : M. Then U(L)i, and T> can be transformed into a 
closed derivation V in Reg* with conclusion ()U(L) = ()M by: 

— replacing each formula occurrence o of (Ay) P: N in T> by an occurrence of the 
formula U(( Xy) letrec B in P) = (Xy)N in 2?', where B arises as the union of all 
outermost binding groups in conclusions of instances of (FIX) at or below o, and 
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where (Xy)P = (Xy)P[f := cj] and / is comprised of the recursion variables occurring 
in B and Cj are distinct constants for / as chosen by T>; the unfoldings involved here 
are always defined. 

Proof. Let T> be a closed derivation in ann-RegQ with conclusion () L : M . 



By Proposition 8.8 V can be transformed into a closed derivation T>\ in ann-Regj^ trec 
with the same conclusion. Due to Proposition |8.7| by dropping the infinite terms in 2?i, 
a derivation T>2 in Regj^ trec with conclusion ()L can be obtained. Then it follows from 



Theorem 6.24 that U(L)l, that is, that L unfolds to an infinite A-term. 

We have to show that the transformation of T> into V as described in the statement of 
the lemma is, on the one hand, possible (that is, the unfolding of each prefixed A| e t re c-term 
is indeed defined), and on the other hand, that the prooftree T>' obtained by these 
replacements is indeed a valid derivation in Reg* . 

We argue for the possibility of these replacements and for their correctness locally, that 
is by carrying out the replacements from the bottom of T> upwards, thereby recognising 
for every replacement step that it is possible, and that it indeed produces a valid inference 
in Regp. 

As a typical example of the arguments necessary to establish this fact, we consider a 
derivation T> in ann-RegQ with in it an instance of (A) that immediately succeeds an 
instance of (FIX): 

[(Xxy) c u :M ] u 

(Xxy) L [u := c„] : M 

— b IX, u 

(Xxy) letrec u = Lq in u ■ Mq 

(Xx) (Ay. letrec u = L in u) : Xy.M 
()L:M 

According to the statement of the lemma, T> is transformed into the following Reg*-prooftree: 



...(U((Xxy) letrec B , u = L , B' in u) = (Xxy)M ) u ... 

U((Xxy)\etrecB , u = L \n L ) = (Xxy)M 

: b lA, u 

U((Xxy) letrec B , u = L in m) = (Xxy)M 

^ A 
U((Xx) letrec B in Xy. letrec u = L in u) = (Xx) Xy.M 

W(()letrec in L) = ()M 

where B arises as the union of all outermost binding groups in conclusions of instances 
of (FIX) strictly below the visible instance of (FIX), and B' is the union of all outermost 
binding groups in conclusions of instances of (FIX) strictly above the visible instance 
of (FIX) and below the indicated marked assumptions (this binding group differs for 
different marked assumptions of this assumption class) , and where L is the result of 
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replacing in Lq all occurrences of constants c/ by the recursion variable / from which it 
originates. 

Now assuming that the unfolding in the conclusion of the visible instance of (@) has 
been shown to exist, we want to recognise that this instance and the instance of (FIX) 
above are valid instances in Regg. For the instance of (A) we have to show: 

U((Xx) letrec Bq in Ay. letrec u = Lq in u)i 

==> for a term Xxy.N : U((Xxy) letrec Bq, u = Lq in L )i = (Xxy)N 

a U((Xx) letrec Bq in Ay. letrec u = Lq in u) = (Xx) Xy.N 
This, however, is an easy consequence of the following -^ v -rewrite steps: 

(Xx) letrec Bo in Ay. letrec u = Lq in u -^ v .a (Xx) Ay . letrec Bq in letrec u = Lq in u 

^v.ictrec (Xx) Xy. \etrec B ,u = L in u 



in view of the fact that, by Lemma 3.16 unfolding is unique normalising (in at most to 
steps). And for the instance of (FIX) we have to show: 

U((Xxy) letrec Bq, u = Lq in u)i => U((Xxy) letrec Bq, u = Lq in Lq)1 = (*) 

=(*) a U((Xiy) letrec Bq, u = Lq, B' in u)i = (*) 

(actually the statement as in the second line has to be shown for every binding-group B' 
that occurs for marked assumptions discharged at the instance of (FIX)). This implication 
is a consequence of the -»- v -rewrite steps: 

(Xxy) letrec Bq, u = Lq in u -> v . rcc (Xxy) letrec Bq, u = Lq in Lq 
(Xxy) letrec Bq, u = Lq, B' in u ^ v . rc d (Aiy) letrec Bq, u = Lq in u 

again in view of the statement of Lemma |3.16| 

The arguments used here are typical, and can be carried out similarly also for showing 
that axioms (0), and instances of rules (@) and (S) in ann-RegQ-derivations give rise 
to, under the transformation described in the statement of the lemma, valid instances of 
axioms (0), and instances of (@) and (S), respectively, in Reg*-derivations. □ 

Theorem 8.11. If i- an n-Reg + () L: M holds for a A| etrec -term L and an infinite A-term M, 
then L unfolds to, and hence expresses, M. 

Proof. Suppose that T> is a closed derivation in ann-Regg with conclusion () L: M. 
Lemma |8. 10| entails that L unfolds to an infinite A-term, and moreover, that T> can be 
transformed into a closed derivation V' in Reg* with conclusion ()U(L) = ()M. Then it 
follows by Theorem 6.17 (applying soundness of Reg with respect to the property of 
Aietrec-terms to unfold to an infinite A-term that U(L) = M , and hence that L -»■" M . In 
this way we have found a A| e trec-term L that expresses M. □ 

We now arrive at our main characterisation result. 



Theorem 8.12. An infinite A-term is A| e t re c-expressible if and only if it is strongly regular. 
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Proof. Let M be an infinite A-term. The direction "=>" is the statement of Theorem 6.26 
For showing the direction "«=" in the statement of the theorem, we assume that M is 
strongly regular. 

Then by Lemma 6.13[ (fii I , there exists a closed derivation T> in Reg + with conclusion 
()M. Due to Lemma 8.4 (ij, T> can be transformed into a derivation f> in ann-Rego 



with conclusion () L: M, for some A| etr ec-term L. Then it follows by Theorem 8.11 that 



the A| e trec-term L expresses M. □ 

As an immediate consequence of Theorem |8.12| and of Theorem |7.13| we obtain the 
following corollary, a summary of our main results. 

Corollary 8.13. For all infinite A-terms the following statements are equivalent: 

(i) M is A| etr ec-expressible. 

(ii) M is strongly regular. 

(iii) M only contains finite binding-capturing chains. 



9. A-transition graphs 

In this section we introduce the concept of A-transition graphs. A A-transition graph Q 
of a term M can be understood as a nameless graphical representation closely related 
to the term graph of M in de-Bruijn notation. It is a graph that corresponds to the 
sub-ARS that is induced by M with respect to some scope + -delimiting strategy for Reg* , 
but where no information can be extracted from the objects. Consider, for example the 
sub-ARSs displayed in Figure [8] and Figure [9] but ignore the prefixes by which the nodes 
are annotated. To capture the notion of 'forgetting' the term associated with each object 
of the ARS we use the formalism of labelled transition systems, in which only transitions 
are observable (see Section [2]). 

We will show a coinduction principle for infinite A-terms: two A-terms are equal if and 
only if they have bisimilar A-transition graphs. 

Definition 9.1 (transition systems induced by CRSs). Let A = (A, $, src, tgt) be 

a sub-ARS, or a sub-ARS of a labelled version, of an ARS that is induced by a CRS C 



with rules R (see (Terese, 2003 11.2.24) for a definition of induced ARSs). In particular, 
every step in $ carries information according to from which rule of C it stems from. By 
the LTS induced by A we mean the LTS Cj± = (A, R, ») with transitions 

>* := {(a, p, a') \ (3(f> e $) ip : a -*■ a' a step that stems from rule p) 

in which the steps in A according to rule p are interpreted as transitions with label p. 
And for a subset Rq of R, by the LTS induced by A with silent R -steps we mean the 
LTS C AiRo = {A,R, »') with 



{(o >ft o')|(306*)^ 



a -»R ' a ' where -»,r are steps w.r.t.^ 
ules in Rq, and -* p is a step w.r.t rule p e R\R i 



in which the steps in A according to rules in Rq are interpreted as silent transitions, and 
the remaining rules as transitions according to their name. 



Expressibility in the Lambda Calculus with letrec 



65 



Definition 9.2 (transition graph of an object). If a is an object of the ARS A that 
is induced by a CRS C with rules R, and {A, R, >-*■) = £( _») the LTS induced by (a-»), 
then we call QaW) '■= (S, A, a, »■) the transition graph of a. 

For an LTS C( a ^.) t R = {A, R, >->■} with silent i?o-steps, we call GA,B. ( a ) := (S, A, a, >*) 
the transition graph of a with silent Ro-steps. 

Definition 9.3 (A-transition graph). We call a labelled transition graph Q = (S, A, x, ») 

a X-transition graph if: 

— it is connected 

— the labels are A = {A, S, @ , @i} 

— there are no infinite paths in Q consisting solely of S-transitions 

— every state belongs to one of the following kinds: A-states, S-states, and ©-states, 
where 

— a A-state s is the source of precisely one A-transition, and no other transitions: 
{(I, t) | (s, I, t) £ w} = {(A, t)} for some t e S. 

— a S-state s is the source of precisely one S-transition, and no other transitions: 
{(I, t) | (s, I, t) e «•} = {(S, t)} for some t e S. 

— a ©-state s is the source of precisely one @o-transition and one @i-transition, but 
no other transitions: {(I, t) \ (s, I, t) e >-*■} = {(@o, t), (@i, u)} for some t,u e S. 

Proposition 9.4. Let § + be a scope + -delimiting strategy of Reg + . For every term M e 
rer((A°°)) the transition graph Qg+(M) of M is a A-transition graph. 

Proof. In transition graphs Q§+(M), infinitely many successive S-transitions are not 
possible because in the ARS that induces Gs+(M), the rewrite relation S is terminating, 



due to Proposition 4.8 mvh. □ 



Along the lines of Proposition [94] we can also view transition graphs of A| e trec-terms as 
A-transition graphs, but only when treating unfolding steps as silent transitions. As hinted 



before in Remark 5.7 here the restriction of scope-delimiting (and scope + -delimiting) 
strategies to ones that prevent indeterminism in the application of unfolding rules is 
relevant. 

Proposition 9.5. Let S + be a scope + -delimiting strategy of Regl etrec . For every term 
L e Ter((A| etrec )), the transition graph 5s+ i ij v (i) of L is a A-transition graph. 

Definition 9.6 (A-transition graph of a term). 

(i) Let M e Ter(A°°). For a scope 4 " -delimiting strategy § + of Regf etrec we call the transition 
graph <7§+(M) the X-transition graph of L with respect to § + . And more generally, by a 
X-transition graph of M we mean a transition graph that is bisimilar to the transition 
graph of M with respect to a scope + -delimiting strategy S + . 

(ii) Let L e Ter((A| etrec )) be a (prefixed) S + -productive A| etrec -term. For a scope + -delimiting 
strategy S + of Regl etrec such that L is S + -productive, we call the transition graph 
Gs+^iL) the X-transition graph of L with respect to S + . And more generally, by a 
X-transition graph of L we mean a transition graph that is bisimilar to the transition 
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graph of L with respect to a scope + -delimiting strategy S + with the property that L 
is § + -productive. 

For prefixed A-terms in Ter((A°°)) and in Ter(A| etre c) we use the terms 'A-transition 
graph' and 'transition graph' synonymously. We also speak of A-transition graphs of terms 
L € Ter (A| e t r ec) or M € Ter(A°°) by which we refer to the A-transition graphs of ()L and 
()M, respectively. 

Theorem 9.7 (coinduction principle for A°°). For all infinite A-terms M and N the 
following statements are equivalent: 

(i) M = N. 

(ii) H EQ ~ M = N. 

(iii) M and N have bisimilar A-transition graphs. 



Proof. In view of Proposition 0.15[ (JTTJ) , the logical equivalence between ^ and (JTTJ) , and 



the fact that |i} => (JTTTJ) clearly holds, it suffices to show that ( |iii| ) => (JTTJ) holds. 

For this, suppose that Q§+ (M) and Q§+ (N) are bisimilar for some scope + -delimiting stra- 
tegies and for Reg. But now bisimilarity of these transition graphs guarantees that 
a derivation V in EQ°° with conclusion ()M = ()N can be constructed such that all 
threads in T> correspond to -»■§+ -rewrite sequences on M and to ->§+ -rewrite sequences 
on M, respectively. If the construction process is organised in a depth-fair manner (for 
example, all non-axiom leafs at depth n are extended by appropriate rule instances, 
before extensions are carried out at depth greater than n), then in the limit a completed 
derivation 2?°° with conclusion ()M = () N is obtained. This establishes i-eq~ M = N. □ 

Conjecture 9.8 (coinduction principle for A| etr ec)- For all L\,L<2 e Ter(A| etre c) it 
holds that L\ = L2 if and only if L\ and L2 have bisimilar A-transition graphs. 

Remark 9.9 (only Reg + defines nameless representations). For the concept of 
'A-transition graph of a term' that is defined, in analogy with Definition |9.6[ as transition 
graph of a term with respect to scope-delimiting strategies (instead of with respect to 
scope + -delimiting strategies), a similar coinduction principle does not hold. 

To realise this, consider the sub-ARS § e ag of Reg, induced by the eager scope-delimiting 
strategy for the term Xxy.xxy, in Figure [7] The corresponding LTS can be obtained as 
the LTS corresponding to a sub-ARS of Reg induced by an appropriate scope-delimiting 
strategy for each of the following four A-terms: 

Xxy.xxx Xxy.xxy Xxy.yyx Xxy.yyy 

For both of the terms in the middle the eager scope-delimiting strategy can be chosen, 
but not for the terms on the left and on the right. 

The understanding of A-transition graphs as nameless representations of an infinite 
A-terms implies that from a such a graph the corresponding A-term can be extracted. We 
define a function for this purpose by means of a CRS which implements the assembly 
of a A-term from the infinite unfolding of a A-transition graph. The function is closely 
related to the Parse* in the sense that Parse* does both destruct and reconstruct the 
scrutinised term while read back only implements the reconstruction. 
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Definition 9.10 (readback). 

readback : Ter oo ({0, A, @, S}) - Ter(\°°) 

t <-* readback(i) := infinite normal form of rwo(t) 
w.r.t. the following CRS: 

rw n (X 1 , . . . ,X n , A(t )) -> abs([x]rw n+1 (X 1 ,,..,X n ,x,t )) 

rw„(X, @(*o,*i)) -+ app(abs([x]rw„(X,t )),abs([x]rw„(X,ti))) 

rw n+ i(X,x, S(t )) -> rw n (X,i ) 

rw n (Xi, . . . ,X n , 0) -> X n 

The function is partial because rw n is unproductive for infinite S-chains. That restriction 



comes forth accordingly in the definition of A-transition graphs (Definition 9.3). The 
function is thus complete on the subset of Ter°°({0, A, @, S}) that is obtained from 
unfolding a A-transition graph. 



10. Conclusion and Outlook 

In this work we have introduced a number of formalisms for relating infinite A-terms and 
finite terms in the A-calculus with letrec to each other. In the following we recapitulate 
the most important concepts briefly. 

We provide CRS signatures to define the set of infinite A-terms and the set of A| etre c- 
terms, which we connect by the CRS il v for unfolding A| e trec-terms to their corresponding 
A-term. To determine which A| e trec-terms have an infinite unfolding we identify productive 
Aietrec-terms. 

To characterise the set of A-terms for which there exists a corresponding A| e trec-term 
(such that the former can be obtained from the latter via unfolding) we establish a 
framework of formalisms for 'observing' A-terms coinductively. Firstly we introduce 
prefixed A-terms that enrich A-terms by an abstraction prefix. On the prefixed terms we 
define the CRS Reg + in which a rewrite sequence corresponds to a deconstruction of a 
term along one of its paths. In that sense a prefixed term (Xx)M can be understood 
as a 'suspended decomposition' which has not advanced into subterm M yet. Such a 
decomposition describes a path through the term by observations of the form ->@ , 
-*§j, -►deli where the latter delimits the scope + of an abstraction. 

Since there is some freedom as to where scope + -delimiters can be placed, we define 
scope + -delimiting strategies to formalise specific possible choices eliminating that freedom 
and thereby making the observations deterministic except for the forking into the left 
or the right subterm of an application. By means of scope + -delimiting strategies we can 
formulate two important concepts: strong regularity and A-transition graphs. 

The intuitive understanding of strong regularity is the property of a infinite A-term 
M that from M every 'sufficiently eager' scope + -delimiting strategy can only generate 
a finite number of terms. We then show that A| etrec -expressibility coincides with strong 
regularity. 

Every scope + -delimiting strategy defines a A-transition graph of a term which can be 
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viewed as a nameless graphical representation very similar to its term graph in de-Bruijn 
notation with the difference that S'-nodes are not restricted to occur only near leafs but 
can be shared by variables. The eager scope + -delimiting strategy yields finite A-transition 
graphs for strongly regular A-terms. 

We adapt the concepts of the CRS for observing terms, scope + -delimiting strategies, 
and A-transition graphs and apply them to A| e trec proving similar results as for A°°. 

We provide a proof system that is sound and complete for the notion of strong regularity 
and which admits finite proofs for strongly regular A-terms. We define an annotated 
version of the proof system which not unlike an attribute-grammar definition implements 
the extraction of a A| e trec-term L from a proof for term M in that system, such that L 
unfolds to the M. We show that every scope + -delimiting strategy induces a proof and 
that from a proof a corresponding history-aware strategy can be deduced, which suggests 
a similar correspondence between A-transition graphs and proofs. 

The following results are within reach but not worked out yet: 

— coinduction principle for A; etrec : For all L, P e Ter(A| etre c) it holds that L = P if and 
only if L and P have bisimilar A-transition graphs. 

— a proof system for unfolding equivalence of A| etre c-terms 

— a thorough coinductive treatment of A-transition graphs and finality results 

— finite representations of regular A-terms as higher-order recursive program schemes 



(cf. Example 4.25 ) and their extractions from formalised proofs of regularity 

— characterisation of A| etrec -expressible preterms of infinite A-terms as those that can be 
generated, up to a-equivalence, by first-order recursive program schemes 

— a terminating readback function to extract A] etre c-terms from transition graphs 

We feel that in gathering these results we have gained a new perspective on the 
A-calculus with letrec. We expect be able to use the concepts and formalisms introduced 
here to obtain related results of the following kinds: 

— efficient decision of unfolding equivalence of A| etr . e c-terms by means of the (nearly 



linear) DFA-equivalence algorithm of Hopcroft and Karp (Hopcroft and Karp, 1971). 
a partial order for the amount of subterm sharing in a A| e trec-term leading to 

— a notion of maximal sharing for A| etre c 

— an efficient mechanism to derive the maximally shared form of a A| e trec-term which 
generalises common subexpression elimination 

a higher-order rewriting framework founded on A| e t re c instead of the A-calculus 
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Appendix A. Proof: Confluence of letrec-unfolding 



Proof of Proposition 3.11 (on page 11) 



To show confluence of i? v we use 'confluence 



by decreasing diagrams' (Terese, 2003 Sec. 14.2). Instead of a direct proof we prove 



confluence of parallel -Ry-steps, which are also indexed with their letrec-depth at which 
they are applied. The letrec-depth of a redex in A| e trec-term denotes the number of let„- 
nodes passed on the path from the root of the term tree to the corresponding position. 
We write <\>^ to denote rule cf> confined to redexes at letrec-depth d. 

We will show that two diverging parallel steps in i? v can be joined in an elementary 
diagram of the form 



v 

= 4>e-l 



e-1 



with d < e. If we pick as the order on the steps the order that is induced by their letrec- 
depth, the diagram is decreasing. Note that in the diagrams of this proof in we implicitly 
assume the reflexive closure for all arrows. To justify the diagram we distinguish the cases 
d = e and d < e. 

For d = e we need to consider parallel diverging (j)- and V'-steps contracting redexes 
at the same height d. We construct the diagram below whose outer arrows constitute a 
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particularly simple instance of the above diagram. We write terms as fillings of a multihole 
context C with all its holes at letrec-depth d such that the contracted and f/Vredexes 
are filled into these holes. In this way we can make explicit at which position a step takes 
place. The topmost row and the leftmost column are sequcntialisations of the parallel 
diverging (f>d- and i/'d-steps, respectively. 



C[L , . . .,L n ] 



4>d 



C[L' Q , L 1 . . . , L n \ 



L[,L 2 . .., L n ] 



<Pd . <Pd . 

C[Lq, L 1 , . . . , L n ] > C[L® ,L 1 , . . . , L n ] > C[Lf , L[,L 2 , 



C[L%,L°,L 2l . . . , L n ] 

ipd 



ipd 



ipd 



C[-^0'-^i' ■■■■> L n ~\ > C[Lf , Lf,L 2 , . . . , £„] 



C[Lq, . . .,L' n ] 



ipd 



> C[Lq, Lf , L 2 , ... , i^] 



V>d 



"0d = 



C[£g,...,i°] Y >C[L°,L°,...,L°] ..r*>C[L°,L?,I%,...,L°] ™> ■■> C[L®, . . . , L®] 

Only the tiles on the diagonal require closer attention because for all other tiles the 
vertical and horizontal steps take place in different holes of the context, therefore they are 
disjoint and consequently commute. In the tiles on the diagonal the diverging steps may 
be due to a critical pair, but since all steps take place at the same letrec-depth any such 
critical pair must arise from a root overlap. Below we scrutinize all diagrams arising from 
joining such critical pairs. Each of these diagrams conforms to the tiles on the diagonal. 
Note that the letrec-depths of the steps have to be increased by d according to the lifting 
into a context with its hole at letrec-depth d. 



letrec in Xx.L 
V-nil 



V.A 



Ax. letrec in L 
V.nil 



letrec B in Xx.L 
V.red 



V.A 



Ax. letrec B in L 

V-rcd 



Xx.L 



Xx.L 



letrec B' in Xx.L ■ ■■■■> Ax. letrec B' in L 
V.A 



letrec B'mLP 



letrec in LP 

V.nil 

I 

LP 



(letrec in L) (letrec in P) v.rcd 
- v.nil 

V 

LP 



letrec B'mLP -■■> 
v.@ 



(letrec B in L) 
(letrec B in P) 



= v-rcdo 



(letrec B' in L) 
(letrec B' in P) 
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V-reco 

letrec B in f t > letrec B in L; v.letrec 

letrec in letrec B in L > letrec B in L 



V-redo 



V-redo 



letrec B' in fa > letrec B' in Li 

V-rec 



V.nil 
letrec B \nL 



letrec B in L 



V-letreco V-redo 
letrec B in letrec C in L > letrec i? C in L letrec B \nL >• letrec B' in L 



V-redo 
letrec B' in letrec C in L 



V-rcd v-redo 

Y | 

> letrec B' C in L letrec 5° in L 

V-letreco v-rcd 



V-redo 
> letrec B' in L 



C[L , . . . ,L n ] 



For d < e we use the same approach as for d = e, the diagram is however more involved. 
Again, we use a context C with context holes at letrec-depth d. But since e > d, more than 
one f/'e-redex can be filled into a hole. Therefore a per-hole partitioning of the vertical 
steps requires a sequence of parallel steps. 

4>d 



C[L%Li,...,L n ]^ -t>C[L®,L 1 ',...,L n ] ^ > C[L®, L[, L 2 , . . . , L n ] 9d > <Pd - > C[L°,L{, . . . ,L' n ] 



C[L' , L x . . . , L n ] 

y 

~i>e-l 



C[Lq,L' 17 L 2 . . . ,L n ] 



e-1 



C[Lg, L° L 2) . . . , L„] ^ > f > C[L°, L° L 2 ,..., L„f ^ f > C[L°,Lf, L 2 , . . . , L n ] ?*> - - - 9d > C[L°, Lf,L 2 ,..., L' n ] 



,1pe-l 4>d 



e-1 



Ve 



1pe = 



C[Lq, . . . ,L' n ] 



e-1 



= V>e 



-Ipe 



~ll>e 



C[L°, ;..,L°] t! ' : -'> f >C[L°,I,° ...,L°] '- V -> %C[L°,Lf,L°,...,L°] 



1pe = 



i>e-l 4>d 



= lpe 
V 

~i>e-l 

V 

C[L®, . . . 



This diagram fits the scheme of the elementary diagram when interleaving the ip e - with 
the V'e-i-steps in the rightmost column such that steps at depth e preceed those at depth 
e-1. This is possible since the segments represent contractions within different holes of C 

As in the previous diagram the tiles which do not lie on the diagonal are unproblematic, 
which leaves us to complete the proof by constructing the tiles on the diagonal. 

Every hole is filled with at most one </>d-redex but because of d < e with possibly many 
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"0 e -redexes; there may or may not be an overlap between the 4>d~step and a i/> e -step, but 
there can be at most one, which is due to the rules of R v (see Figure 22). 




Fig. 22. Two exemplary redex constellations for d < e. Both have one (^-redex and 
three Vvredexes. On the left, one of the ip e -tedexes overlaps with the </>d-redex 
while on the right all ^ e -redexes are strictly nested inside the i^-redex. 



At the end of the proof is a comprehensive scrutinisation of all critical pairs arising 
due to a non-root overlap, as well as of all situations with nested redexes, with diagrams 
of the following schemes, respectively: 



'(/'I 



Wo 



4>e 



tpe> e'e{e,e-l} 



When lifted into a context of letrec-depth d both of the diagrams comply to the shape 
necessary for the diagonal tiles, but we need to be able to handle situations as on as 
on the left of Figure [22j where both nested redexes as well as the overlapping redex 
are contracted. Firstly, since all '(/'-redexes occur at the same letrec-depth, it must hold 
that d = and e = 1. Secondly, none of the involved redex contractions affect any of the 
nested redexes except for duplicating or erasing them, which means that the residuals of 
the V'-steps after these steps are part of a a parallel ?/v-step (mind that we assume the 
reflexive closure of all steps). Or in a diagram: 
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1>i 



f 

= Ve„ e,e{0, 1} 



The diagram is composed from the previous two diagrams. A parallel version of the right 
one constitutes the top part, while the bottom part is an exact replica of the left one. 
The top part settles the portion arising from the nested redexes, the bottom part settles 
the portion arising from the ovelapping redex. 

At last in order to fit that diagram into the scheme of the diagonal tiles the steps on 
the right have to be reordered such that tp ei -steps with a = 1 preceed ip ei -steps with 
d = 0. The reordering is viable because every ^> ei -step takes place in its own residual of 
the Vi-step from the left. 

We conclude the proof by a comprehensive analysis all critical pairs that arise from 
non-root overlaps in ii v , and after that the diagrams for joining nested redexes: 

V-letrcco 

letrec B in letrec C in Xx.L >■ letrec B C in Xx.L 



v-Ai 

y 

letrec B in Aa;. letrec C in L 
v-A 



v.A 



Ax. letrec B in letrec C in L > Ax. letrec B C in L 

V-letrcco 



letrec B in letrec Cm LP 



V-letrecp 



V.©i 



letrec B in (letrec C in L) (letrec C in P) 



letrec B C in LP 



V.(Q>o 



(letrec B in letrec C in L) 
(letrec B in letrec C in P) 



V-letrecp 



(letrec B C in L) 
(letrec B CinP) 
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V-letreco 

letrec B in letrec C in /» > letrec B C in /j 



V-rcc! 



V-rcc 



letrec B in letrec C in Li > letrec -B C in Lj 

V-letreco 

V-letreco 

letrec B in letrec C in letrec B in L > letrec B in letrec C D in L 



V-letreci 



V-letreco 



letrec I! ( ' in letrec B in L > letrec B C D \n L 

V-letreco 

, , V-letrecn 

t-j - I . V-letreco letrec B in letrec C in L > letrec B C in L 

letrec B in letrec in L > letrec B in L 



V-nili 
letrec B in L 



V-redi 



V-redo 



, n> ■ r letrec B in letrec C" in L , > letrec B C in L 
letrec B in L y.letrec 



Diagrams for joining nested redexes: 
V.A 



letrec B in Xx.L 



\x. letrec B in L letrec B in L Li 



V.@0 



* (letrec B in L ) (letrec B in L\) 



letrec B' in Xx.L' Ax. letrec B' in X' letrec B' in L L[ ■-■■■> (letrec B' in L' ) (letrec B' in Li) 

v.A v.@ 

V-rccg 

letrec Bin/ » letrec B in L 



letrec B' in L letrec B in letrec C in L 

V — 

letrec B' in f > letrec B' in L' letrec B' in letrec C in L' 

V-rcco 

V.nil 
letrec in L * L 



V-letrecg 



letrec B C in L 

VV e'€{e-l,e} 



> letrec B' f " in /.' 

V-lctrecg 



V-redo 

letrec B in L * letrec B* in L 



e-1 



1p e 



letrec in L' .,-> V letrec B° in L' , > letrec B® in L' 

V-nil v.red 



This concludes the proof. 



□ 



